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Abstract. Redshift maps of galaxies in the Universe are distorted by the 
peculiar velocities of galaxies along the line of sight. The amplitude of 
the distortions on large, linear scales yields a measurement of the linear 
redshift distortion parameter, which is /? ~ Og^/ft in standard cosmology 
with cosmological density fig and light-to-mass bias b. All measurements of 
P from linear redshift distortions published up to mid 1997 are reviewed. 
The average and standard deviation of the reported values is /^optical = 
0.52 ± 0.26 for optically selected galaxies, and Piras = 0.77 ± 0.22 for 
IRAS selected galaxies. The implied relative bias is ^optical /^ffiyl 5 ~ 1-5. If 
optical galaxies are unbiased, then f^o = O.SSIqJI ; while if IRAS galaxies 
are unbiased, then ^Iq = 0.63lo]27 ■ 



1. Introduction 

The organisers of this thoroughly enjoyable workshop asked me to write 
a review of redshift distortions aimed primarily at graduate students and 
others who are not familiar with the field. The review aims at fairly thor- 
ough coverage (up to mid 1997) within a rather limited scope: the subject 
of redshift distortions in the large scale, linear regime. The review does not 
attempt to cover the large body of work involving the direct measurement 
of peculiar velocities. The latter has been the subject of recent compre- 
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hensive reviews by Strauss k. Willick (1995), and by Dekel (1994). Both of 
those reviews included sections on redshift distortions. Nor does the present 
review cover nonhnear redshift distortions, except insofar as they affect hn- 
ear redshift distortions. For an entry to the hterature on nonhnear redshift 
distortions, try Davis, Miller k White (1997). 

Bubble's (1929) law states that the recession velocity cz of a galaxy is 
proportional to its distance d 

cz = Hod , (1.1) 

with constant of proportionality the Hubble constant Hq (the subscript 
signifies its present day value). The recession velocity cz of a galaxy can 
be measured from the redshift z of its spectrum (c is the speed of light), 
a great deal more easily and accurately than its true distance d. This has 
been a primary motivation for redshift surveys (see e.g. Strauss 1997 for a 
recent review) , which map the Universe in 3 dimensions using the recession 
velocity cz of each galaxy as a measure of its distance. 

Hubble's law is not perfect, however. Galaxies have peculiar velocities v 
relative to the general Hubble expansion. Thus it is necessary in general to 
distinguish between a galaxy's redshift distance s (conveniently expressed 
in velocity units) 

s = cz (1.2) 
and its true distance r (also conveniently expressed in velocity units) 

r = Hod . (1.3) 

The redshift distance s of a galaxy differs from the true distance r by its 
peculiar velocity v = r.v along the line of sight: 

s = r + v . (1.4) 

The peculiar velocities of galaxies thus cause them to appear displaced 
along the line of sight in redshift space. These displacements lead to red- 
shift distortions in the pattern of clustering of galaxies in redshift space. 
Although such distortions complicate the interpretation of redshift maps 
as positional maps, they have the tremendous advantage of bearing in- 
formation about the dynamics of galaxies. In particular, the amplitude of 
distortions on large scales yields a measure of the hnear redshift dis- 
tortion parameter (5, which is related to the cosmological density ^o, 
the present day ratio of the matter density of the Universe to the critical 
density required to close it, by (see §4.1) 

(1.5) 
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in standard pressureless Friedmann cosmology with light-to-mass bias b. 
The goal of much of the current work on redshift distortions is to mea- 
sure the linear distortion parameter f3, and perhaps, pious hope, if bias 
can be quantified through nonlinear effects or otherwise, to determine the 
cosmological density fio itself. 

1.1. PLAN 

The plan of this paper is as follows. 

Section 2 attempts to convey visually what redshift distortions look like 
and why, both §2.1 schematically, and §2.2 observationally. 

Section 3 is about power spectra. Sections 3.1 and 3.2 contains defini- 
tions, needed for subsequent reference, of correlation functions and power 
spectra in real and redshift space. Section 3.3 advertises the delights of 
Hilbert space. Section 3.4 explains why power spectra are better. 

Section 4 presents the theory of redshift distortions in the linear regime. 
The first part, §4.1, explores what 0, the linear redshift distortion parame- 
ter, really means. The second part, §4.2, derives the linear redshift distor- 
tion operator, which transforms real space into redshift space, for fluctua- 
tions in the linear regime. The third and fourth parts, §§4.3 and 4.4, cover 
two small but important details, the peculiar motion of us, the observers, 
and the difference between the selection functions in real and redshift space. 

Section 5 describes the three different types of method that have been 
used to date to measure /3 from linear redshift distortions: §5.1, the ratio 
of real to redshift angle-averaged power; §5.2, the ratio of quadrupole-to- 
monopole harmonics of the redshift power spectrum; and §5.3, the maxi- 
mum likelihood approach. 

Section 6 interjects an example of measuring (3 from linear redshift dis- 
tortions, partly as an illustration, partly to bring out the difference between 
optical and IRAS -selected galaxies, and partly to demonstrate the impor- 
tance of nonlinear ities. 

Section 7 describes the two methods that have been used to date to 
deal with nonlinearity: §7.2, a model in which linear redshift distortions 
are modulated by a random velocity dispersion; and §7.3, the Zel'dovich 
approximation. 

Section 8 compiles measurements of /3 from linear redshift distortions, 
complete up to the first half of 1997. Section 8.1 summarizes the measure- 
ments of /3 in a Table, and gives the average and standard deviation of 
the measurements, which results are the ones quoted in the abstract. Sec- 
tion 8.2 offers commentary on all the individual measurements. I apologize 
to authors whose work has been inadvertently omitted, or inadequately 
portrayed. 
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Figure 1. A spherical ovcrdensity appears distorted by peculiar velocities when observed 
in redshift space. On large (linear) scales the overdensity appears squashed along the line 
of sight, while on small (nonlinear) scales fingers-of-god appear. At left, the overdensity 
is far from the observer (who is looking upward from somewhere way below the bottom of 
the diagram), and the distortions are effectively plane-parallel. At right, the overdensity 
is near the observer (large dot), and the large scale distortions appear kidney-shaped, 
while the finger-of-god is sharpened on the end pointing at the observer. The observer 
shares the infall motion towards the overdensity. A similar diagram appears in Kaiser 
(1987). 



Finally, §9 discourses briefly on cosmological redshift distortions, which 
arise from differences in the geometry of the Universe perceptible at high 
enough redshift. 

The greater part of this review is just that, a review; but there are some 
new things here and there. The expression for the linear redshift distortion 
operator S*'"*^ for the practical case where (a) the redshift overdensity 
is measured in the Local Group frame, and (b) the selection function is 
measured in redshift space, also in the Local Group frame, appears here 
explicitly for the first time, equation (4.81). The analysis of the Stromlo- 
APM survey reported in §6 has not been published elsewhere. 



2. What Redshift Distortions Look Like 

2.L SCHEMATICALLY 

Figure 1 illustrates how a spherical overdensity appears distorted by pecu- 
liar velocities along the line of sight, when observed in redshift space. The 
initial spherical overdensity perturbation here was taken to be a power law 
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Figure 2. Detail of how peculiar velocities lead to the redshift distortions illustrated in 
Figure 1. The dots arc 'galaxies' undergoing infall towards a spherical overdensity, and 
the arrows represent their peculiar velocities. At large scales, the peculiar velocity of an 
infalling shell is small compared to its radius, and the shell appears squashed. At smaller 
scales, not only is the radius of a shell smaller, but also its peculiar infall velocity tends 
to be larger. The shell that is just at turnaround, its peculiar velocity just cancelling 
the general Hubble expansion, appears collapsed to a single velocity in redshift space. At 
yet smaller scales, shells that are collapsing in proper coordinates appear inside out in 
redshift space. The combination of collapsing shells with previously collapsed, virialized 
shells, gives rise to fingers-of-god. 



with radius, 5 oc r"^, located in an expanding Universe with critical mean 
density, = 1. The free-fall gravitational collapse of such a spherical pres- 
sureless overdensity can be computed analytically (Peebles 1980, §18). The 
dots (galaxies) started out uniformly distributed in the initial conditions, 
being uniformly placed around a series of uniformly spaced concentric shells. 
Thus the density of dots in Figure 1 indicates the density of galaxies in the 
collapsing overdensity, as observed in redshift space. Figure 1 omits shells 
that have collapsed to less than half their radius at turnaround, which shells 
may be expected to scatter off previously collapsed shells, and to virialize. 

Figure 2 shows how peculiar velocities produce the pattern illustrated 
in Figure 1. On large scales, peculiar infall towards the overdensity causes 
it to appear squashed along the line of sight. The squashing increases to 
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smaller scales down to the point of turnaround, where the peculiar infall 
velocity exactly cancels the general Hubble expansion. In the turnaround 
shell, the near and far parts of the shell appear collapsed to a single radial 
velocity in redshift space. At smaller scales, shells that have turned around 
and are collapsing in real space appear turned 'inside out' in redshift space. 
At even smaller scales (not shown in Figures 1 or 2), collapsed shells are 
expected to virialize. The combination of collapsing and virialized regions 
of galaxy clusters gives rise to fingers-of-god. 

2.2. OBSERVATIONALLY 

Fingers-of-god are well-known features of redshift surveys. Prominent ex- 
amples are the fingers-of-god in the Coma cluster (de Lapparent, Geller k, 
Huchra 1986) and the Perseus cluster (Wegner, Haynes & Giovanelli 1993, 
Figs. 7-10). 

The envelope of the fingcr-of-god in Figure 1 forms a caustic, a surface of 
infinite density (but finite mass). Such caustics arc not obviously seen in real 
fingers-of-god (Rcgos &: GcUcr 1989); presumably the caustics are smeared 
out by subclustering. The structure of the well-studied Coma cluster, for 
example, is quite complicated (CoUess & Dunn 1996). 

Visually, the large scale squashing effect is more subtle to discern in real 
data. Are prominent transverse structures such as the Great Wall (Ramella, 
Geller &; Huchra 1992) enhanced by redshift distortions? Probably yes, at 
some level (e.g. Praton, Melott &; McKee 1997). However, Dell' Antonio, 
Geller &: Bothun (1996) conclude from their analysis of the peculiar velocity 
field of the Great Wall that any infall is small, ^ 150 km s^^. 

The large scale squashing effect can however be detected statistically, 
from the distortion of the redshift space correlation function or of its 
Fourier transform the redshift space power spectrum P*. These quantities 
are defined formally in the next section, §3. Physically, the redshift correla- 
tion function ^^(s//, s±) is the mean fractional excess of galaxy neighbours 
of a galaxy at separations s// and s±_ parallel and perpendicular to the line 
of sight, a definition that suffices to understand Figure 3. 

Figure 3 shows redshift space correlation functions ^*(s//, s^) measured 
from two sets of redshift surveys, one (QDOT -|- 1.2 Jy) selected in the 
infrared, the other (Stromlo-APM) in the optical. The infrared survey is a 
merger of two Infrared Astronomical Satellite {IRAS) redshift surveys, the 
QDOT survey, and the 1.2 Jy survey. The revised QDOT survey (Lawrence 
et al. 1997, in preparation) is a redshift survey of a l-in-6 subset of galaxies 
brighter than 0.6 Jy from the IRAS Point Source Catalog (PSC). It con- 
tains 2376 galaxies over 9.29365 steradians, covering most of the sky above 
galactic latitude |6| > 10°. The IRAS 1.2 Jy survey (Fisher et al. 1995a; 
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Figure 3. Contour plots of the redshift space two-point correlation function as a 
function of separations s// and s± parallel and perpendicular to the line of sight in: (left) 
the IRAS QDOT and 1.2 Jy redshift surveys, merged over the angular region of the sky 
common to both surveys; and (right) the optical Stromlo-APM survey. In each case the 
region within 25^t^^Mpc of the Milky Way has been excluded, so as to eliminate bias 
from the local ovcrdensity. A near minimum variance pair weighting has boon applied. The 
thick contour signifies — 0, and other contours are logarithmically spaced at intervals 
of 0.5 dex above 10~^'® (the left panel also shows one negative contour, at — 10~^'^). 
Shading is graduated at intervals of 0.1 dex above 10^^^. The correlation function here 
has been smoothed over pair separation s = {s\^ + S//)^^^ with a tophat window of width 
0.2 dex, and over angles 6 = tan~^(sx/s//) to the line of sight with a Gaussian window 
with a la width of 10°. 



Strauss et al. 1992a) is a complete redshift survey of galaxies brighter than 
1.2 Jy from the same IRAS PSC. It contains 5321 galaxies over 11.02577 
steradians, covering most of the sky above galactic latitude |6| > 5°. For 
Figure 3, the surveys were merged over the part of the sky common to 
both surveys, yielding 5752 galaxies over 9.26740 steradians. Removing the 
region closer than 25/i~^Mpc, to avoid local bias (Hamilton &; Culhane 
1996, §2.1) left 4826 galaxies. The optical survey is the Stromlo-APM sur- 
vey (Loveday et al. 1996b), which is a redshift survey of a l-in-20 subset of 
galaxies brighter bj = 17.15 from the Automatic Plate Measuring (APM) 
survey (Maddox et al. 1990a,b, 1996). The Stromlo-APM redshift survey 
contains 1790 galaxies over 1.32467 steradians centred roughly around the 
South Galactic Pole. Eliminating the region closer than 25 /i~^Mpc left 1725 
galaxies. 

To ensure the validity of the plane-parallel approximation, the redshift 
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correlation function shown in Figure 3 was computed including only pairs 
closer than 50° on the sky, the line of sight to each pair being defined as the 
bisector (the angular midpoint on the sky) of the pair. A near minimum 
variance pair weighting was applied as described by Hamilton (1993b, §5, 
eqs. [60] & [61]). Further details of the procedure used here to calculate 
^^(s//, s±) are given in the latter paper. 

Figure 3 shows clearly the expected large scale squashing effect, while 
fingers-of-god show up as an enhancement of the redshift correlation func- 
tion along the line-of-sight axis. Besides these two expected effects, two 
other features are apparent. Firstly, the redshift correlation function of the 
optically selected galaxies is roughly a factor of 2 higher than that of the 
IRAS selected galaxies. Secondly, the fingers-of-god are longer and more 
prominent in the optical galaxies. Both these features can be attributed 
to the fact that IRAS galaxies, which arc diisty, gas-rich spirals, avoid the 
centres of rich clusters of galaxies, where ellipticals rule. 

A contour plot of the redshift space power spectrum from an iV-body 
simulation can be found in Cole, Fisher & Weinberg (1994, Fig. 1), while 
Bromley, Warren & Zurek (1997, Fig. 1) show plots of the redshift space 
power spectrum as a function of ^ = tan~^ {k±/k//) from two large (17 
million particle) high resolution simulations. 

3. Correlation Functions and Power Spectra 

Sections 3.1 and 3.2 collect standard definitions of correlation functions and 
power spectra, in real and redshift space respectively, needed elsewhere 
in this review. Compared to theoretical correlation functions, defined in 
§§3.1.1 and 3.2.1, observed correlation functions contain additional noise, 
the dominant contribution to which is often taken to be Poisson sampling 
noise, also known as shot noise, discussed in §§3.1.2 and 3.2.2. I adhere 
throughout this review to what has become the standard convention in the 
field of galaxy clustering for defining Fourier transforms, notwithstanding 
the extraneous factors of 27r that result. 

Section 3.3 points out how Hilbert space provides a compact, powerful 
notation. 

Section 3.4 explains what makes power spectra special. 
3.1. REAL SPACE 

A fundamental proposition is that the three dimensional distribution of 
matter in the Universe constitutes a statistically homogeneous and isotropic 
random density field. This proposition, a reflection of the proposition that 
the Universe at large is homogeneous and isotropic — Milne's 1933 Cosmo- 
logical Principle (see Peebles 1980, §3B) — is powerfully evidenced by the 
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isotropy of the Cosmic Microwave Background (Bennett et al. 1996; Gorski 
1997). 

The statistical properties of such a field are completely determined by its 
irreducible moments, or correlation functions as they are commonly called 
in the disciphne of large scale structure (Peebles 1980, Ch. III). The first 
two irreducible moments are the mean, a constant, and the covariance, or 
2-point correlation function, a function of separation. 

In the case of a multivariate Gaussian random field, for which by defi- 
nition the third and all higher correlation functions vanish, the mean and 
2-point correlation function completely specify the statistical properties of 
the field. If fluctuations generated in the early Universe were the result 
of a superposition of many independent random processes, as for exam- 
ple quantum fluctuations during inflation, then the Central Limit Theorem 
guarantees that the fluctuations will form a multivariate Gaussian. Fluctu- 
ations on large, linear scales today would then also be Gaussian. Available 
observational evidence is consistent with fluctuations being Gaussian on 
linear scales (Stirling k. Peacock 1996; Kogut et al. 1996). 

Once density fluctuations grow large, they cannot remain Gaussian, 
since density must remain positive. Thus on small, nonlinear scales the third 
and higher order correlation functions must be non-vanishing. Nonetheless 
the covariance, the 2-point correlation function, is well-defined in the non- 
linear regime, and remains a statistic of fundamental importance. 

3.1.1. The True Density Field of the Universe 

Let p{r) denote the density of matter at (real, not redshift) position r in 
the Universe, and let p, a constant in space, denote the mean density. The 
density p{r) may be regarded as the density of mass, or of particles, or of 
galaxies, or of some particular set of objects one is interested in. Whatever 
the case, the density p{r) signifies the true density, not the observed den- 
sity of objects that may be recorded in a real survey (which is considered 
in §§3.1.2 and 3.2 following), and r is the true position, not the redshift 
position. The true overdensity 6{r) at position r is defined by 

5{r) ^ . (3.1) 

It is to be noted that although 5{r) is defined to be the 'true' overden- 
sity, it is nevertheless a hypothetical quantity, to be distinguished from a 
quantity, such as the redshift, angular position, or fiux of a galaxy, which 
is directly observed. This is the usual situation in statistics, where one 
imagines a hypothetical 'true' population from which observed data are 
drawn. The properties of statistical homogeneity and isotropy attributed 
to the 'true' density field are likewise theoretical hypotheses, whose validity 
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can be tested against observed data, though never proven absolutely. The 
distinction between observed data and theoretical models thereof appears 
starkly in the maximum likelihood procedure, §5.3. 

The most basic statistic that can be constructed from the overdensity 
(the mean having been subtracted off) is its variance, its second irreducible 
moment, the correlation function ^(ri2) (also known as the 2-point cor- 
relation function, or 2-point function, or covariance function, or autocovari- 
ance function), 

ari2) = (5(ri)(5(r2)) . (3.2) 

Equation (3.2) states that the expectation value of the product of overdensi- 
ties at a pair of randomly positioned points separated by ri2 in the Universe 
is ^(ri2). More physically, the correlation function ^(ri2) is the mean over- 
density of neighbours around a random particle (galaxy) (Peebles 1980, 
§31). The assumption that the density field is statistically homogeneous 
and isotropic means that the correlation function ^(ri2) is a function only 
of the scalar separation ri2 = \ri — r2\ of the points ri and r2, not of their 
overall location or orientation. 

The Fourier transform of the overdensity defines the Fourier modes S{k) 
at wavevector k (hats denote Fourier transforms throughout this review) 

S{k) = J e'^-'^S{r)d^r , 5(r) = J e''^-^ Sik) d^k/{2Trf . (3.3) 

The power spectrum is by definition the covariance of Fourier modes, 
which from the definitions (3.3) and (3.2) is equal to the Fourier transform 
of the correlation function 

{diki)d{k2)) = J e'^'-''^+'^'-'''^{n2)d\id\2 . (3.4) 

Since the correlation function (,{r\2) is a function only of separation, equa- 
tion (3.4) reduces to^ 

{5{ki)5{k2)) = {2Tif5D{ki + k2)P{ki) , (3.5) 
where P{k) is also called the power spectrum, 

P{k) = j (^^■'^i{r)d^r , i{r) = j e''^-'^ P{k)d^k/{2TTf . (3.6) 

^It is also fine to define the power spectrum as the covariance of modes 
with one of the modes taken to be the complex conjugate, in which case 
{5{k\)5* {ki)) = (27r)^fo(fei — k2)P{ki). The equivalence of the two definitions is made 
clear in §3.3. The advantage of the symmetric choice (3.5) becomes more apparent 
when dealing with higher order correlation functions, such as the 3-point function 
(5(fci)5(fe2)5(fc3)). 
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The 'momentum-conserving' 3-dimensional Dirac delta function (^0(^1+^2) 
in equation (3.5) expresses the assumed translation invariance, i.e. statisti- 
cal homogeneity, of clustering, while the fact that P{k) is a function only 
of the absolute value k of the wavevector k expresses statistical isotropy. 

It is useful also to give here results in spherical transform space, in which 
the density field is expanded in spherical harmonics about the observer. Let 
Sim{i^) denote spherical modes in real space 




{dor denotes an interval of solid angle in real space) and similarly let 8irn{k) 
denote spherical modes in Fourier space 

5tm{k) = / Y,^{k)d{k)dof^ , d{k) = Y,yim{k)5tm{k) (3.8) 
■' em 

{doj^ denotes an interval of solid angle in Fourier space) where are the 
usual orthonormal spherical harmonics. The spherical transforms in real 
and Fourier space are related by 

Sem{k) = /47r / ji{kr)6im{r)r'^dr , 
Jo 

POO 

Semir) = r^47r / ji{kr)Sem{k) k^dk/{2Trf (3.9) 
Jo 

where je{kr) arc spherical Bessel functions. The reality conditions 6*{r) = 
S{r), hence 6*{k) = 6{—k), along with the usual properties = 
{—)"^Y£_jn and Y^^(— fc) = (— )^l£m(fe) of the spherical harmonics, imply 

SUr) = i-rk-mir) , 5Uk) = {'Y^^'k-mik) • (3.10) 
The correlation function of spherical modes in real space is 

{Se,mAri)Serrn,ir2)) = i-r'Sf^i,€,^-mMn,r2) (3.11) 

where 6^ is the Kronecker delta, and the reduced correlation function 
^^(^,'"2) is related to the power spectrum P{k) by 

POO 

e^(ri,r2) = (47r)2 / u{kn)h{kr2)P{k)k'dk/{2'Kf . (3.12) 
Jo 

The extraneous minus signs in (3.11), and also in the next equation (3.13), 
disappear if one takes the complex conjugate on one of the overdensities, 
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as in (66*) in place of {66); see §3.3 for clarification of this point. The 
correlation function of spherical modes in Fourier space is 

{6e,mAki)6e,m,{k2)) = {-Y'+"''6f^,X^^_^^{2nf6D{h-k2)k^''P{ki) 

(3.13) 

where — ^2) is the 1-dimensional Dirac delta-function, satisfying 

/ 6r){k)dk = 1 integrated over any interval containing the origin A; = 0. As 
explained in §3.3, the expression in front of P{k\) on the right hand side of 
equation (3.13) is just the unit matrix in Mm-space, equation (3.42). 

3.1.2. The Observed Density Field and Shot Noise 

Real surveys probe only a portion of the density field of the Universe, 
and what they do survey is liable to be an imperfect representation of the 
true density field. This subsection discusses what is often assumed to be 
the dominant source of noise in a survey, which is Poisson sampling noise, 
often also called shot noise. It is assumed in this subsection that the data 
lie in real space, not redshift space. Corresponding results in redshift space 
are given in §3.2.2. 

Typically, a galaxy survey does not include all galaxies in a region of 
space, but only, say, those brighter than some flux limit. Thus a survey 
generally blends a finer sampling of dim galaxies nearby with a sparser 
sampling of luminous galaxies farther away. A survey is characterized by its 
selection function n(r), which is the expected mean number of galaxies 
at position r given the selection criteria (e.g. the flux limit) of the survey. 

The selection function n(r) of a survey must be measured from it. The 
problem of measuring the selection function of a survey is reviewed by 
Binggeli, Sandage &: Tammann (1988), and is discussed further in §4.4. 
Some additional comments on the measurement of the selection function 
appear in the maximum likelihood section, §5.3, in the paragraph following 
equation (5.18). 

In order to combine heterogeneously sampled regions, the (testable) hy- 
pothesis is commonly made that the galaxies observed are drawn randomly 
from a hypothetical continuous underlying population of galaxies. The ob- 
served galaxies then form a Poisson process on the underlying population, 
and the selection function n(r) is interpreted as specifying the probability 
(in units of number of galaxies per unit volume) of including a galaxy at 
position r into the survey. 

Let n{r) denote the observed number density of galaxies at (real, not 
redshifted) position r in a survey. The number density n(r) is a sum of delta 
functions, since galaxies come as discrete units. The observed galaxy 
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overdensity 5obs(^) is then defined by 

_ n{r) - n(r) 
<^°bs(^) = • (3-14) 

In the Poisson process model, the observed galaxy overdensity (^obs('") pi'o- 
vides a discretized but unbiased estimate of the true overdensity S(r), equa- 
tion (3.1). The subscript obs is retained throughout this section 3 to distin- 
guish the estimate Jobs from the hypothesized true value S, but is dropped 
from most of this review to avoid an overly ponderous notation (correctly, 
estimated values not only of 6 but also of all other measured quantities 
should be distinguished from their 'true' values). It should be clear from 
the context whether what is meant is the theoretical 'true' value of a quan- 
tity, or an observational estimate thereof (the theoretical sections 4, 5, and 
7 refer largely to theoretical quantities, while the observational sections 6 
and 8 report mainly estimates thereof). 

In the Poisson process model, the expectation value C{ri,r2) of the 
covariance of observed overdensities is a sum of the true correlation function 
^(ri2) with a Poisson sampling noise, or shot noise, term: 

(^obs(ri)5obs(r2)) = C(ri,r2) =C(ri2) + (5D(ri -r2)[n(ri)]-i . (3.15) 

The form ^^(ri —r2)[n{ri)]~^ of the Poisson sampling term can be derived 
from the following argument. First, note that Jy n{ri)n{r2)d'^rid'^r2 = 
Jy n{ri)d^ri = or 1 when the integration is over an infinitesimal volume 
V, which either does not (0) or does (1) contain a galaxy. It follows that 
the expectation value of the product of densities in the same infinitesimal 
volume element is {n{ri)n{r2)) = 5d(^i — 'r'2){n{ri)) = doifi — 'r2)n{r\)^ 
whence (5obs('"i)<^obs('"2)) = ^oiji — i'2)'^ki'^\~^ i which is the shot noise 
term of equation (3.15) as claimed. The Poisson sampling term reflects the 
fact that the probability of finding yourself as a neighbour at zero separa- 
tion is unity. The shot noise becomes infinite in regions outside a survey 
where the selection function is zero, which makes sense. 

Equation (3.15) shows that, in the Poisson process model, the expecta- 
tion value of the survey covariance is equal to the true correlation function 
at any finite separation 

C(ri,r2)ri^r2 =e(n2) . (3.16) 

Thus any average (5obs('"i)<^obs('"2)) of products of pairs of overdensities 
at any finite separation r\2 = — T2I 7^ 0, weighted in any arbitrary a 
priori fashion, provides an unbiased estimate of the true correlation function 

e(n2). 
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In estimating the correlation function from a survey, the shot noise 
contribution can be eliminated by excluding from the computation all self- 
pairs of galaxies (pairs consisting of a galaxy and itself). Alternatively, it 
may be convenient to include self-pairs, and to subtract off the shot noise 
as a separate step. 



3.2. REDSHIFT SPACE 

Redshift space quantities (distinguished in this review by a superscript 
s) are defined analogously to real space quantities. Let n*'(s) denote the 
observed number density of galaxies at redshift position s in a redshift 
survey (the observer is at the origin s = 0). 

A slightly subtle point in the definition of overdensity S^{s) in redshift 
space arises because, as emphasized by Fisher, Scharf & Lahav (1994), the 
apparent brightness of a galaxy depends on its true distance r, not its 
redshift distance s, so that the selection function of a flux- limited redshift 
survey is correctly a function n(r) in real space, not redshift space. This 
suggests that one might define the observed overdensity in redshift space 
by (^obs(*) = ['^''(*) ~ n{r)]/n{r). This is possible, but it requires knowing 
not only the true selection function n(r) in real space, but also the true 
distance r to each galaxy, which involves carrying out a full reconstruction 
of the deredshifted density field (Yahil et al. 1991; Fisher et al. 1995b; 
Webster, Lahav & Fisher 1997), whereupon one might as well work with 
the overdensity (5obs('') in real space. 

There remain two alternative possibilities for defining the redshift over- 
density. The first option is to define the observed redshift space galaxy 
overdensity S^^g{s) at redshift position s relative to the real space selection 
function n(s) evaluated at the same redshift position s 

KM - ■ (3.17) 

This is the definition adopted by Kaiser (1987), and also adopted in §4.2. 

A drawback of the definition (3.17) is that it can be tricky to estimate 
the real space selection function n(s) from a redshift survey, whereas it is 
relatively straightforward to estimate the redshift space selection function 
n*(s) (see footnote^). Thus a second possibility is to define the observed 
redshift space overdensity S^^g{s) [with a double ss superscript to distin- 
guish it from 5Qjjg(s)] relative to the redshift space selection function n*(s) 

*XW-=^^4^. (3.1«) 

^Hamilton & Culhane (1996) claim that the real and redshift selection functions agree 
to linear order, but this is false. See the paragraph just before §4.4.1. 
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The relation between the real and redshift space selection functions 
n(s) and n^{s), and the consequent relation between the redshift space 
overdensities S^{s) and S^^{s), is discussed in §4.4. 

3.2.1. The True Density Field in Redshift Space 

As in real space, in redshift space it is necessary to distinguish between the 
observed redshift overdensity 5^^^g{s), equation (3.17), and the hypothetical 
'true' redshift overdensity 6^{s) whose existence is predicted theoretically. 
For fluctuations in the linear regime, the theoretical prediction is that the 
true redshift overdensity S^{s) is related to the true unredshifted overden- 
sity 6{r) by equation (4.22), derived in §4.2. Similarly, corresponding to 
the observed redshift overdensity 6^^^^{s), equation (3.18), is a hypothetical 
'true' redshift overdensity S^^{s) predicted by equation (4.73). Correspond- 
ing predictions for the redshift overdensities S^^^ and ^^^^c; measured in 
the Local Group frame instead of the Cosmic Microwave Background frame 
are given by equations (4.45) and (4.79). The definitions in this subsection 
remain valid for all the various flavours of the redshift overdensity: just 
change the superscripts appropriately. 

Unlike the true overdensity 6{r) in real space, which is defined indepen- 
dent of the selection function of the survey, the true overdensity S'^{s) in 
redshift space depends on the selection function. This is fine, except that 
an ambiguity arises for regions outside the survey, where the selection func- 
tion is zero. Specifically, the true redshift overdensity 6^{s) depends on the 
selection function n(r) through the quantity a{r) = dlnr^n{r)/dlnr in 
the redshift distortion operator (4.23). The quantity a{r) is zero divided 
by zero outside the survey, where the selection function is zero, so is am- 
biguous. 

Two comments can be made about this ambiguity in the definition of the 
true redshift overdensity <5*(s) outside the survey. Firstly, one can choose to 
resolve the ambiguity in whatever manner is convenient, and this choice has 
no effect whatsoever on the comparison between theory and observation. 
This is because the ambiguity in a(r) occurs only at points outside the 
survey, whereas comparison between theory and observation is made only 
at points inside the survey. A convenient choice in a flux-limited survey, 
where the selection function n(r), hence a{r), is a function only of depth 
r, not of direction r within the angular boundaries of the survey, is to take 
a{r) to have the same value outside the angular boundaries as within. The 
advantage of this choice is that it preserves the angular symmetry of the 
'true' redshift correlation function about the observer. Beyond the radial 
extent of the survey, a{r) could be set to zero, or any other convenient 
choice. 

The second comment is that the ambiguity disappears in the plane- 
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parallel, or distant observer, limit, where the a term in the distortion op- 
erator goes to zero. In the plane-parallel limit, the 'true' redshift space 
overdcnsity S^{s) becomes independent of the selection function, and the 
'true' plane-parallel redshift space correlation function and power spectrum 
are correspondingly independent of the selection function. This is the con- 
text in which the concept of a redshift space power spectrum is in any case 
most often considered (§4.2.1). 

Suppose hereafter then the true redshift space overdensity d^{s) has 
been defined through all space, the ambiguity outside the boundaries of 
the survey being resolved if possible as described in the previous two para- 
graphs. The redshift space correlation function ^^(si2, si, S2) is defined 

by 

e{si2,Si,S2) = {d'{si)d'{S2)) . (3.19) 

Redshift distortions partially destroy the symmetry enjoyed by the unred- 

shiftcd correlation function ^(ri2), so that the redshift correlation function 
C^isu, Si, S2) is a function of the redshift distances si and S2 as well as the 
separation S12 of a pair of galaxies. Redshift distortions do however preserve 
the rotational symmetry of the correlation function about the position of 
the observer^. 

If the angle between the positions Si and S2 of the galaxy pair is 
small enough, then the line-of-sight redshift distortions are effectively plane- 
parallel, as illustrated in the left panel of Figure 1, and the redshift cor- 
relation function reduces to a function only of the components s// and 
s± of the pair separation S12 respectively parallel and perpendicular to the 
line of sight z 

eisi2,si,S2)^e{s//,s±) . (3.20) 

The Fourier transform of the true redshift space overdensity defines the 
redshift Fourier modes 6^{k) (here the necessity for 6^{s) to be defined 
everywhere, including outside the survey, is apparent) 

§'{k) = J e''^-^5'{s)dh , 5'{s) = J e-'^-^6'{k)d^k/{2TTf . (3.21) 

The covariance of redshift Fourier modes defines the redshift space power 
spectrum, which is equal to the Fourier transform of the redshift correla- 
tion function 

{6\ki)5\k2)) = J e'^^-^^+'^'-^'e{si2,si,S2)dhid^S2 . (3.22) 

^ More correctly, the redshift correlation function has orientation symmetry provided 
that the selection function is independent of direction over the parts of the sky surveyed 
(which need not be the whole sky), as in a redshift survey with a uniform flux limit. A se- 
lection function that is variable with direction over the sky destroys orientation symmetry 
about the observer, through the a(r) term in the redshift distortion operator (4.23). 
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The line-of-sight redshift distortions destroy statistical homogeneity, so that 
the redshift power spectrum is no longer a diagonal matrix. The residual 

orientation symmetry about the observer^ implies that the redshift power 
spectrum is a function only of scalar combinations of its arguments 

{5'{ki)5'{k2))=ti\ki + k2\,ki,k2) . (3.23) 

In the plane-parallel approximation, however, redshift distortions do 
preserve statistical homogeneity, and in that case the redshift power spec- 
trum is again a diagonal matrix 

{6'{ki)6'{k2)) « {27rf6D{ki + k2)P'{ki//,ki^) , (3.24) 

where ki// and ki± are the components of the wavevector fci respectively 
parallel and perpendicular to the line of sight z, and 

P%k//, k^) = I e'^-^eis//, s^_) . (3.25) 

Spherical modes (5|^(s) and S^^{k) in redshift space are defined analo- 
gously to equations (3.7) and (3.8). Redshift distortions preserve rotational 
symmetry about the observer (but see footnote 3), so the redshift corre- 
lation functions remain diagonal with respect to the angular indices £m. 
Equations (3.7)-(3.11) take the same form in redshift space, aside from the 
addition of s superscripts. The relation (3.12) is modified to 

C|(si,S2) = (47r)2 / ji{kiSi)je{k2S2mki,k2)kldkikyk2/{2Trf (3.26) 
Jo 

and the correlation function of spherical modes in Fourier space becomes 

{SlmMSlmM) = {-Y''''^'5f^t,€u-mM{kl. ^^2) ■ (3.27) 

3.2.2. The Observed Density Field and Shot Noise in Redshift Space 
As in real space, in redshift space the observed galaxy density is subject 
to Poisson sampling noise, or shot noise. In the Poisson process model 
described in §3.1.2, the expectation value C*(si,S2) of the covariance of 
observed redshift space overdensities 5^^^{s) defined by (3.17) is a sum of 
the true redshift space correlation function ^*(si2, sij 52), equation (4.32), 
with a Poisson sampling noise term 

('^obs(«i)<5obs(«2)) = C'{si, S2) = f (si2, Si, S2) + Sd{si - S2)[n(si)]-^ 

(3.28) 

Again, orientation symmetry is strictly true only if the selection function is uniform 
within the angular boundaries of the survey, because of the a(r) term in the redshift 
distortion operator (4.23). 
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which may be compared to the corresponding equation (3.15) in real space. 
The coefficient [n(si)]~^ of the Dirac delta-function in the Poisson sam- 
pling term is valid for a flux-limitcd redshift survey; the coefficient could 
conceivably be different if there were some criterion other than a flux 
limit for selecting galaxies into the redshift survey. The form of the Pois- 
son sampling term can be derived by an argument similar to the one in 
real space which followed equation (3.15). An important part of the argu- 
ment in redshift space is that the expectation value of the product of red- 
shift densities in the same infinitesimal volume element is {n^{si)n^{s2)) = 
Sd{si — S2)(n*(si)) = (5d(si — S2)n(si), the last step of which comes from 
the fact that, in a flux-limited redshift survey, the ensemble-averaged ex- 
pectation value of the redshift density at a point is equal to the real (not 
redshift) selection function, (n*(s)) = n(s). 

Similarly, the expectation value C**(si, S2) of the covariancc of observed 
redshift space overdensities (^obs(*) defined by (3.18) is a sum of the cor- 
responding true redshift space correlation function ^^^{si2, 81,82), equa- 
tion (4.83), with a Poisson sampling noise term 

{KL{si)StL{s2)) = C^'{si,S2) = e'{si2,si,S2) + 6n{si - ^2)0^ . 

(3.29) 

Again, the coefficient n(si)/[n*(si)]^ of the Poisson samphng term is valid 
for a flux-limited redshift survey. The form of the coefficient follows from 
the same argument as that following equation (3.28). 

Equation (3.28) shows that, in the Poisson process model, the covariance 
of observed redshift space overdensities at any finite separation si ^ S2 
provides an unbiased estimate of the true redshift correlation function, 

C'{si,S2)s,^s^ = r (si2, Si, S2) , (3.30) 

similar to the corresponding result (3.16) in real space. Similarly, from 
equation (3.29), C^'^ {81,82)3-1^82 — ^^^(sia, ^i, S2). As in real space, the 
shot noise contribution to the survey covariance in redshift space can be 
removed by excluding all self-pairs, consisting of a galaxy and itself. 

3.3. HILBERT SPACE 

It can be confusing to keep track of the mess of 27r's and other factors, and 
the juxtaposition of discrete and continuous quantities, in equations such 
as (3.13). Hilbert space offers a compact, unifying formalism that takes 
care of the bookkeeping and provides much insight. It is not the place of 
this subsection to present a complete account of Hilbert space; rather, the 
goal is to demonstrate how concepts familiar from quantum mechanics also 
prove powerful in the present statistical context. 
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The overdensity can be regarded as a vector S in an infinite-dimensional 
vector space, Hilbert space. This vector has a meaning independent of the 
basis, i.e. complete set of linearly independent functions, with respect to 
which it is expanded. Thus for example the overdensity vector d has compo- 
nents Sr [= S{r)] when expanded with respect to a basis of delta functions 
in real space, and it has components Sj^ [= S{k), but now the hat is dropped 
from Sf^ in recognition of the fact 5r and (5^ are really the same vector] 

when expanded with respect to a basis of Fourier modes e~*^'^; but from 
a Hilbert space point of view the vector 6 remains unchanged: only the 
'coordinate system', the basis of expansion functions, changed. This is in 
precise analogy to ordinary finite-dimensional vectors, which have a mean- 
ing independent of the particular coordinate system used to locate them. 
Similarly, the covariance matrix 



(not an inner product!) can be regarded as a matrix in Hilbert space, with, 
for example, components ^rir2 real space, or ■Cfc^fcj Fourier space. 

The quintessential property of Hilbert space is the existence of an inner 
product, or scalar product, of any two vectors a and b, which maps the two 
vectors into a scalar, a (complex-valued, in general) number. This scalar 
value is independent of any basis with respect to which the vectors may be 
expressed. Various compact notations exist for writing the scalar product: 



The first expression in (3.32) is the Dirac bra-ket notation, in which the 
ket I a) denotes a vector and the bra (a| its Hermitian conjugate, the com- 
plex conjugate transpose of the vector (the angle brackets ( ) here are not 
to be confused with statistical averages; in the Dirac notation, a vertical 
line always separates the bra and the ket). The middle expression is ma- 
trix notation, in which the vector denotes the Hermitian conjugate of 
the vector a, and Tr signifies the trace. The final expression of (3.32) is 
the inner product in component form: the quantities arc the compo- 
nents of the vector a with respect to some orthonormal basis of functions 
(see eqs. [3.34], [3.36]), while the quantities a* = a* with raised index de- 
note their Hermitian conjugates, and there is implicit summation over the 
indices i (the similarity to tensor notation in general relativity is not co- 
incidental). It is to be understood that the index i may be either discrete 
or continuous, or a combination of both, and that in the continuous case 
summation signifies integration with respect to some defined measure of i; 
examples of the latter are seen immediately below. Other variants of the 
inner product notation also occur, such as the omission of the trace sign 




{a\b) = TTa^b = a%i . 



(3.32) 
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TV in the third expression, if there is no ambiguity, or the insertion of an 
exphcit summation or integration over i in the last expression. In practice 
it may be typographically convenient to keep all indices lowered, it being 
understood that when doubled indices occur, one of the pair is implicitly 
raised, signifying the Hermitian conjugate. 

In a statistically homogeneous 3-dimensional random field, equal volume 
elements d^r have equal statistical 'weight', so it is logical to define the inner 
product of two (complex-valued, in general) functions a(r) and b{r) as an 
integral over (fr. With respect to the various bases encountered in §3.1, 
the inner product of a(r) and 6(r) can be written 



(a|6) = j a*{r)h{r)Sr = j a*{k)h{k)d^k/{2Tif (3.33) 
= Y.^lmir)hUr)r^dr= ^imik)bem{k)k^dk/ {27: f . 

The equality between the real and Fourier integrals, the second and third 
expressions in equation (3.33), is Parseval's theorem, an apparent triviality 
in the Hilbert formalism. Note that if the inner product is complex- valued, 
then the order of the inner product matters: the inner product in the op- 
posite order is its complex conjugate, {b\a) = {a\b)* . 

A basis of functions tpi{f) is said to be orthonormal if their inner prod- 
ucts constitute the unit matrix (see eq. [3.38]) 

J A{r)^{r) d\ = ll , ^ V'(r)V'i(r') = fe(r - r') (3.34) 

i 

in which either equality implies the other, given that il^i{r) constitute a ba- 
sis, a complete linearly independent set. In index notation, equations (3.34) 
are 

V',Vr = l^, vvv'/' = ir- (3.35) 

The orthonormal functions ipi{rj) can be regarded as the rows of an or- 
thonormal matrix "^ij, satisfying ^'^^ = "^^^ = 1. The components of 
a vector a with respect to the orthonormal basis ipi{r) are given by 



j il)i{r)a(r) 



dV, a{r) = Y,^\r)ai . (3.36) 



In index notation, these equations are 

Oj = ipi^ar , ar = ip'^r'^i ■ (3.37) 

The rule that raising a lowered index i (or lowering a raised index i) 
means take the complex conjugate works fine for vectors, but the rule be- 
comes less easy to interpret for a matrix, where there is more than one 
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index. In practical calculations it is easier to apply an equivalent set of 
rules, valid for matrices that are real-valued in real space (as is usually the 
case for matrices of physical interest, such as the covariancc matrix {SS), 
eq. [3.31], or the linear redshift distortion operator S, eq. [4.23]). The rules, 
which depend on the basis, are that raising (or lowering) an index i: 

o in real space does nothing; 

o in Fourier space transforms ki —kf, 

o in r^m-space transforms rrii —rrii and multiplies by (— )™'; 
o in Hm-space transforms rrii —nT'i and multiplies by (— )^i+™<. 

The unit matrix ij , whether for discrete or continuous indices, is defined 
by the property that 

ijaj = ai (3.38) 

for any vector Cj. From this definition together with the definition (3.33) of 
the inner product, it is straightforward to determine that the unit matrix in 
real space is (the following equations [3.39]-[3.42] give both the unit matrix 
and its expression with both indices lowered) 

1^' = Soir ~ r') , Irr' = Snir - r') . (3.39) 
Similarly the unit matrix in Fourier space is 

1^' = (27r)3fe(fc - k') , 1^^/ = (27r)^fe(fe + k') . (3.40) 

The unredshifted power spectrum (3.5) is thus recognized as being a diag- 
onal matrix, with eigenvalues P{k). In spherical transform space with the 
radial index in real space, the unit matrix is (abbreviating /x = rim) 

^^J.' — xK f:K - ''') 1 _/ \m^K - r') (oA^\ 
^IJ. — %'Omm' ^ ' ^HH' — \~) %'Om,-m' ^ • l-^-^lj 

In spherical transform space with the radial index in Fourier space, the unit 
matrix is (abbreviating /x = Mm) 

,H'->:K.K {^T^)^SD{k - k') _ .e+^KjrK {2Tr)^SD{k - k') 
^IJ, — p ' -'■/'At' ~ %'Om,-m' p " 

(3.42) 

Thus again the unredshifted power spectrum (3.13) in kim-space is seen to 
be a diagonal matrix, with eigenvalues P{k). 

There is no need to go further here into the Hilbert space formalism, 
which can be learned from any quantum mechanical textbook (e.g. Landau 
&; Lifshitz 1958). The advantage of the formalism is that it frees expressions, 
such as the relation (4.22) between overdensity in real and redshift space, or 
the expression (5.17) for the Gaussian likelihood function, from being tied to 
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any particular basis, such as real space or Fourier space. If such expressions 
are written in component form (i.e. the vectors and matrices are written 
with indices), then any index can be regarded as referring to any basis 
of functions, possibly a different basis for each index, with the following 
two provisos: (1) indices and the bases to which they refer should match 
between the left and right hand sides of an equation; (2) wherever paired 
indices occur, indicating an inner product, they should be evaluated in the 
same (arbitrary) basis. In evaluating an inner product, it suffices to bear 
in mind two rules. First, one of a pair of indices i should always be raised 
(implicitly, if not typographically) indicating the Hermitian conjugate; in 
the case of Fourier space for example, this means using — fcj for one index 
and +ki for the other (at least for vectors and matrices that are real- 
valued in real space). Second, if the paired index is continuous, then the 
volume clement of integration is consistently the same, in accordance with 
the definition of the inner product in the basis used; in Fourier space for 
example, the volume element is d^fci/(27r)^, according to equation (3.33). 

3.4. MERIT OF THE POWER SPECTRUM 

Blackman & Tukey (1959, §B.3) in their classic text on power spectra state 
"The autocovariance function is of little use except as a basis for measuring 
the power spectrum" . The power spectrum of large scale structure in the 
Universe was apparently first measured by Yu &; Peebles (1969) and Peebles 
(1973). 

The fundamental advantage of the (true, unredshifted) power spectrum 
is that the Fourier modes of a statistically homogeneous random field are 
uncorrelated: their covariance matrix {S{ki)5{k2)) is a diagonal matrix, 
equation (3.5). Modes that are uncorrelated are said to be statistically 
orthogonal. The statistical orthogonality of Fourier modes docs not imply 
that they are independent in general, but it does imply that they are inde- 
pendent in the important case of a multivariate Gaussian field, for which 
by definition the 3-point and all higher correlation functions are zero. 

Note that the advantage of the power spectrum over (for example) the 
spatial correlation function is not that it contains any more information — 
the power spectrum and the correlation function have precisely the same 
information content — but rather that the power spectrum parcels the 
information into uncorrelated chunks. 

The statistical orthogonality of the (true, unredshifted) Fourier modes 
is intimately associated with the assumed translation invariance of the sta- 
tistical properties of the density field. As is familiar from quantum mechan- 
ics, the translation operator, the generator of an infinitesimal translation, is 
—id/dr, which is also the momentum operator k (Landau & Lifshitz 1958, 



LINEAR REDSHIFT DISTORTIONS 



23 



§15). The eigenfunctions of the translation operator —id/dr are just the 
Fourier modes 5{k), with eigenvalues k. The operator of an overall trans- 
lation is —id/dR = X] —id/dr, which is the same as the total momentum 
operator K = ^k. The sum here is over all relevant coordinates, e.g. 
the 2 coordinates of the 2-point correlation function. Statistical homogene- 
ity means that the statistical properties, the correlation functions, of the 
density field are unchanged by an overall translation, d^/dR = 0, or equiv- 
alently = 0. Thus the correlation functions have zero total momentum, 
K = Y^k = ss m. equation (3.5). 

Similarly, the statistical orthogonality of (true, unredshifted) spherical 
modes with respect to angular indices £m, equations (3.11) and (3.13), 
is intimately associated with statistical isotropy about the observer (or 
indeed about any other point). The operator of an infinitesimal rotation 
is the angular momentum operator Z, whose eigenmodes are the spherical 
harmonics l^m. Statistical isotropy (about some point) implies that the 
correlation functions have zero total angular momentum (about that point), 
X = = 0, as in equations (3.11) and (3.13). 

Redshift distortions destroy translation invariance, but they preserve 
isotropy about the observer (but sec footnote 3), and hence also preserve 
the statistical orthogonality of spherical modes with respect to the angular 
indices Im. Thus, as emphasized by Fisher, Scharf k. Lahav (1994) and 
Heavens & Taylor (1995), the advantage of the power spectrum can best 
be preserved in redshift space by working in spherical harmonics. 

The statistical orthogonality of Fourier modes windowed through any 
finite survey (i.e., if the overdensity is multiplied by the, possibly weighted, 
selection function of the survey) is destroyed by the fact that the survey 
is not translation invariant, especially at wavelengths approaching the size 
of the survey. Similarly, the statistical orthogonality of spherical modes 
windowed through a non-all-sky survey is destroyed by the break down of 
rotational symmetry. However, the desirability of measuring quantities that 
are uncorrelated remains. Ways to construct statistically orthogonal modes 
in real surveys are described by Vogeley k. Szalay (1996), Tegmark, Taylor 
& Heavens (1997), and Tegmark et al. (1997), and ways to decorrelate power 
spectra are described by Hamilton (1997), and Tegmark et al. (1997). 

4. Theory of Linear Redshift Distortions 

The theory of linear redshift distortions was greatly clarified in a fundamen- 
tal paper by Kaiser (1987). Although the existence of large scale redshift 
distortions and the possibility of using them to measure the cosmological 
density VLq had been recognized previously (Sargent &; Turner 1977; Pee- 
bles 1980, §76B), the complete, correct equations describing linear redshift 
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distortions were first derived by Kaiser. 

The linear distortion equations are presented in §4.2. First, however, it 
is useful to discuss what the linear distortion parameter /3, which is the 
quantity actually measurable from linear redshift distortions, and the focus 
of much of the fuss, actually means. 

4.1. THE LINEARIZED CONTINUITY EQUATION 

I vividly recall a conversation with Jerry Ostriker (in 1991, I believe) in 
which I was excitedly explaining how one could measure the cosmological 
density from redshift distortions in the linear regime. Jerry, a fan of ex- 
plosive cosmologies (Ostriker &; Cowie 1981), was unimpressed. He pointed 
out that redshift distortions are no more than a consequence of the conti- 
nuity equation, that the contimiity equation is of widespread applicability, 
and that what redshift distortions measure is not 0,q but rather (exact 
quote) "a dimensionless number of no particular significance" . 

Jerry's view may be on the blueshifted side of the spectrum of perspec- 
tives, but his point is well taken. Physically, to grow an overdense region, 
galaxies have to move in the general direction of the ovcrdcnsity. More pre- 
cisely, the value of (3 that is measured from redshift distortions in the linear 
regime is the value that solves the linearized continuity equation 

/3(5 + V.u = 0. (4.1) 

Actually, an additional proposition enters here, which is that the peculiar 
velocity field v is irrotational (has zero vorticity), as predicted by grav- 
itational growth theory. These matters are explained below. The reader 
who wishes to gain insight into the problem is invited to solve the problem 
illustrated in Figure 4. 

When dealing with dynamics in cosmology, it is convenient to work in a 
comoving coordinate system, which expands with the general Hubble 
expansion of the Universe. To maintain contact with observational reality, 
it is also useful, as in §1, to express comoving distances in velocity units 
evaluated at the present day, using the present day values Hq of the Hubble 
constant and oq of the cosmic scale factor to relate comoving distance and 
velocity. Thus let r denote the comoving (unredshifted) position of a galaxy, 
measured in velocity units at the present time (r is related to Peebles' 1980 
§7 comoving distance a; by r = Hoaox). For example, the comoving distance 
to the Coma cluster is T ^ 7000 km s ^, as inferred from its measured 
redshift; by definition of comoving coordinates, this comoving distance does 
not change (much) as the Universe expands. 



LINEAR REDSHIFT DISTORTIONS 



25 




Figure 4- Problem: From the linearized continuity equation (4.1) show that, in the hnear 
regime, the pecuhar velocity v of a particle (galaxy) at the present time is related to its 
comoving displacement Ar (measured in velocity units) since the Big Bang hy v = pAr . 
In deriving this result you will need to assume that the density field is initially uniform, 
and that the velocity is the gradient of a potential. Amongst other things, you should 
find that in the linear regime particles move in straight lines. 



The proper peculiar velocity v of a galaxy is its proper velocity in 
the comoving frame 

^ ^ adr _ dr ^^^^ 
Hoaodt dr 

where t is proper time, r is the dimensionless conformal time defined by 
dr = Hoaodt /a, and a(T) is the cosmic scale factor. One of the advantages 
of working with conformal time is that objects (e.g. photons) moving at 
the speed of light c have peculiar velocities dr/dr = c at all times. 

At this point it is necessary to distinguish carefully between matter, 
which satisfies a continuity equation, and galaxies, which, thanks to galaxy 
formation and later merging, may not. The continuity, Euler, and Poisson 
equations for cold (pressureless) matter (subscripted M) in a perturbed 
Friedmann-Robertson- Walker Universe are, expressed in the comoving co- 
ordinate system (Hui &; Bertschinger 1996), 

+V.{l + dM)vM = (4.3) 



dr 

daVM 
adr 



+ VM-VvM = -V<?!) (4.4) 



V <^ = 77272 = ^77272 (4-5) 

where V = 9/(9r is the comoving gradient, pM oc is the proper mean 
matter density, and the cosmological matter density Qm and Hubble con- 
stant H both evolve in time. In the linear regime, where \Sm\ -C 1, the 
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continuity and Euler equations reduce to 



dr 

daVM 



+ V.VM = (4.6) 
= -V(^ . (4.7) 



adr 

If the peculiar velocity is decomposed (as can always be done) into its 
gradient (irrotational, or longitudinal, or scalar) and curl (rotational, or 
transverse, or vector) parts, 

VM = v'it + v^ , (4.8) 

in which the longitudinal part v^f = 'Vip is the gradient of some scalar 
potential -0, while the transverse part v'^f = V x A is the curl of some 
vector potential A, then the linearized Euler equation (4.7) becomes 

^ = _V<^, ^ = 0. (4.9) 
a or a or 

The second of equations (4.9) shows that the curl part of the velocity decays 
as oc as the Universe expands, so should be sensibly equal to zero in 
the absence of a mechanism to generate vorticity. If only gravity operates, 
then the peculiar velocity in the linear regime should be pure gradient. 

When the linearized continuity (4.6), linearized Euler (4.7), and Pois- 
son (4.5) equations are combined, the result is a second order linear differ- 
ential equation for the overdensity Sm, which, when further combined with 
the unperturbed solution for the evolution a(r) of the cosmic scale factor, 
leads to growing and decaying solutions (Peebles 1980, §§10-13; Padman- 
abhan 1993, §4.5) 

5Mir,T)^D{T) (4.10) 

which evolve in time without change of shape. The interesting solution is 
the unstable, growing solution, and D(t) below is taken to be the linear 
growth factor of the growing mode. The linearized continuity equation (4.6) 
can then be written 

+ V.VM = (4.11) 

where / is the dimensionless linear grow^th rate of the growing mode 

_^_HQaQdlnD dlnD (4 12) 

Ha dr din a 
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In the standard pressureless matter-dominated cosmology, the dimension- 
less growth rate / is an analytic function of J7m with a celebrated approx- 
imation as a power law^ (Peebles 1980, eq. [14.8]) 

/{Qm) ~ . (4.13) 

Lahav et al. (1991) give an approximation in the case where there is a 
cosmological constant^ so that the total cosmological density is a sum of 
matter and cosmological constant parts, O = VLm + ^^a: 

f{^M,^K)-^^'d + ^[^ + ^) ■ (4.14) 

Evidently the growth rate / depends mainly on the matter density ^m, 
and is only weakly dependent on the cosmological constant. 

Equation (4.11) is the continuity equation for the matter; to go from 
this to the continuity equation (4.1) for galaxies involves one further issue 
— bias. 

For more than a decade it has been apparent that galaxies may not con- 
stitute an unbiased tracer of the underlying matter density. If galaxies were 
an unbiased tracer of the matter, they would, by definition, satisfy 5 = 5m 
(see footnote''). That some bias exists, at least in some galaxy populations, 
follows from the fact that galaxies selected in different ways have correla- 
tion functions with different amplitudes (e.g. Peacock & Dodds 1994; Oliver 
et al. 1996; Peacock 1997). For example, the correlation function of optical 
galaxies is approximately a factor of 2 larger than that of IRAS galaxies, 
a fact evident from Figure 3, which demonstrates that optical galaxies and 
IRAS galaxies cannot both be unbiased tracers (Oliver et al. 1996). 

The simplest model of bias postulates that the galaxy overdensity S is 
linearly biased by a constant factor, the linear bias factor b, relative to 
the underlying mass density Sm, so that 

S = b6M , (4.15) 

® Occasionally a small fuss is made about the exponent of the power law in this ap- 
proximation (cf. Lightman & Schechter 1990). A sensible person would use the exact 
analytic expression for /(f2M) when reporting actual results. 

^Einstein's cosmological constant A, which could arise from a non-zero vacuum density 
pvac, is related to Q.a by A = SyrGpvac = SH^Qa ■ 

^Since galaxies come as discrete units, one should understand this equation in a prob- 
abilistic sense. Galaxies are unbiased if the probability of finding a galaxy in a volume 
element dV at position r is proportional to the amount of matter pM{r)dV in that volume 
element. The discretized galaxy density n{r) is then said to be a Poisson process, with 
mean proportional to pM{r). The statistical properties of such a discretized distribution, 
i.e. its 2-point and higher correlation functions, coincide with those of the underlying 
matter density field aside from the addition of deltarfunction discreteness terms at zero 
separation (cf. §3.1.2). 
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whereas galaxy velocities faithfully follow the velocity of matter 

V = vm ■ (4.16) 

The linear biasing model (4.15) was originally motivated by threshold 
biasing, in which galaxies are supposed to form only where the matter 
density exceeds a certain threshold (Kaiser 1984), and by peaks biasing 
where galaxies form at the peaks of the matter density (Bardeen et al. 1986). 
These models predict that, at least in some regimes, the galaxy correlation 
function ^(ri2) = {S{ri)6{r2)) should be amplified over the matter corre- 
lation function ^^^(^^12) = (<Jm('"i)'5m('"2)) by an approximately constant 
factor 

^{ri2)-b^^Miri2) . (4.17) 

What is true observationally is that the correlation functions of galaxies se- 
lected in different ways differ mainly in their amplitudes, but only weakly in 
their shapes (Peacock Sz Dodds 1994; Peacock 1997). Ultimately, it remains 
unclear how nature chooses to bias galaxies in reality. 

The linearized continuity equation (4.11) for the matter, evaluated at 
the present time, together with the linear bias model (4.15), yield the lin- 
earized continuity equation (4.1) for galaxies, where the dimensionless quan- 
tity P is related to the present day value /o of the linear growth rate /, 
equation (4.13) or (4.14), and the bias factor b, equation (4.15), by 

(3 = ^. (4.18) 

With the standard formula (4.13) for /, this becomes the oft-quoted rela- 
tion (1.5). 

If the matter peculiar velocity vm is curl-free in the linear regime (see 
the argument following eq. [4.9]), and the galaxy velocity is unbiased, equa- 
tion (4.16), then the galaxy peculiar velocity field v is also curl-free in the 
linear regime. The linearized galaxy continuity equation (4.1) then implies 
that the galaxy peculiar velocity v is related to the galaxy overdensity S at 
the present day by 

V = -PW'^S (4.19) 

where is the inverse Laplacian. All measurements of /3 in the linear 
regime are in effect measurements of the ratio of the peculiar velocity v 
to the galaxy overdensity S, the (testable) relation (4.19) being assumed to 
hold true. This is true not only in the case of large scale redshift distortions, 
but also for measurements of /? from direct comparison of overdensities and 
peculiar velocities (Strauss & Willick 1995), or from comparison of the 
peculiar motion of the Local Group of galaxies to the dipole anisotropy of 
galaxies over the sky (Strauss et al. 1992b). 
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Two final points about bias are worth making. Firstly, the linear bias 
model (4.15) must break down in the nonlinear regime (if 6 > 1, as gener- 
ally expected), since otherwise equation (4.15) would predict that regions 
empty of matter, 6m = — 1, would have negative galaxy density, which is 
impossible. Various models of nonlinear biasing can be contemplated (Pry 
8z Gaztaiiaga 1993; Mann, Peacock Sz Heavens 1997). 

Secondly, if a linear bias bi is established so S{r,ti) = hiSMif^ti) at 
some initial time ij, after which galaxies satisfy continuity (being neither 
created nor destroyed), so that [l-|-(5(r)]/[l+(5M(T')] oc n{r)/ pm{^) remains 
constant in Lagrangian elements after the initial time ti, then the bias factor 
b = 5/6m evolves. During linear growth, ISm] <S 1, the bias decreases in 
time while remaining constant in space, 

m - 1 = ^{bi - 1) , (4.20) 

where D{t) / D{ti) = SM{f,'t)/^M{f,'ti) is the linear growth factor. Equa- 
tion (4.20) shows that the action of continuity is to drive the bias factor 
closer to (but not necessarily all the way to) unity, as fluctuations grow. If 
continuity persists into the nonlinear regime, then the bias factor becomes 
a function also of position, 

b{r, t)-l = SM{r, U) (j^^^^ + l) {bi - 1) (4.21) 

saturating at 6 = 1 in underdense regions (as (5Af(r,t) —1)) and at 
6=1-1- 5M{r,ti){bi — 1) in overdense regions (as SM{r,t) oo). In the 
nonlinear regime, streams from several different initial points may overlap 
at the same final point, so correctly equation (4.21) should be averaged 
over streams. 

The model described in the previous paragraph is of course simplistic. 
The point is that the bias factor b probably evolves, and that there is a 
mechanism — continuity — that tends to drive the bias factor closer to 
unity, that is, to make galaxies less biased as time goes on. 

4.2. THE LINEAR REDSHIFT DISTORTION OPERATOR 

In the linear regime, the overdensity in redshift space is related to the 
overdensity 6 in real space by a linear redshift distortion operator S, 

d' = S5. (4.22) 

Conceptually, it is helpful to recognize that the linear redshift distortion 
operator S is just a linear operator (a matrix in Hilbert space), much like the 
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linear operators one encounters in quantum mechanics. In real (as opposed 
to Fourier, say) space, the distortion operator, derived immediately below, 
is an integro-differential operator (the inverse Laplacian is the integral 
part) 

S-./^(|i.^^)v- (4.23) 

where a(r) is the logarithmic derivative of times the real space selection 
function n(r), 

dlnr'^nir) , ^ 

«(^) = pi • (4.24) 

amr 

The redshift distortion operator (4.23) is valid in the frame of reference of 
stationary observers, those at rest with respect to the Cosmic Microwave 
Background (CMB), and provided that the redshift overdensity 5*(r) is de- 
fined relative to the real space selection function n(r), as in equation (3.17). 
The modifications to the linear redshift distortion operator S for the cases 
where the redshift overdensity is defined in the Local Group frame, and/or 
relative to the selection function ff{r) measured in redshift space, are given 
by equations (4.46), (4.75), and (4.81). 

The linear redshift distortion equations were first derived by Kaiser 
(1987, §2). The derivation below follows most closely that of Hamilton h 
Culhane (1996, §2). 

The starting point of the derivation of the linear redshift distortion 
operator (4.23) is a conservation equation for galaxies, which expresses the 
fact that peculiar velocities displace galaxies along the line of sight, but 
they do not make galaxies appear or disappear (this conservation equation 
is not the same as the dynamical continuity equation discussed in §4.1) 

n\s)d^s = n{r)d^r . (4.25) 

In terms of the redshift space overdensity 5*(s) defined by equation (3.17), 
the galaxy conservation equation (4.25) is 

n(s)[l + b\s)\ s^ds = n(r)[l + 5(r)] r'^dr . (4.26) 

With the relation between redshift position s and real position r 

s = r + vr , (4.27) 

equation (4.26) rearranges to 

1 + 5\s) = (l + ^y\l + 5{r)] . (4.28) 
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So far equation (4.28) is exact, valid in the linear and nonlinear regimes, 
at least until orbit-crossing occurs. 

The next step is to linearize equation (4.28). In addition to the linear 
theory assumption \S{r)\ -C 1, which also implies \dv/dr\ <C 1 if the veloc- 
ity V is given by linear theory, equation (4.31), it is necessary to assume 
that peculiar velocities v of galaxies are small compared to their distances 
r from the observer (actually, this assumption can be dropped, if the red- 
shift space overdensity d^{r) in the CMB frame is constructed according to 
equation [4.52]; see §4.3.2), 

\v\<^r . (4.29) 

To linear order equation (4.28) then reduces to (note in particular that 
(5*(s) = 5^{r) to linear order) 

^^■(r)=5(r)-(|:-f^). (4.30) 

where a(r) is defined by (4.24). In linear theory, the peculiar velocity v is 
given by equation (4.19), so the line-of-sight component v of the peculiar 
velocity in the CMB frame is 

V = -f3-^V-^S . (4.31) 
or 

Inserting this formula into equation (4.30) yields the distortion equa- 
tion (4.22) with the redshift distortion operator S given by equation (4.23), 
as was to be proven. 

It follows from equation (4.22) that the redshift correlation function 
^*(ri2, ri, r2), equation (3.19), in the linear regime is related to its unred- 
shifted counterpart ^(^12) by 




SiS2e(n2) (4.32) 
a{ri)d\ 2 

I -I- / ) I — - 

r2dr2 I 



2 + Th^)vr 



drf ridri 




C(n2) ■ 



This equation is correct for observers who are located randomly in the 
Universe, who are at rest with respect to the CMB, and who define the 
redshift overdensity relative to the real space selection function n(r), as in 
equation (3.17). As discussed in §3.2.2, in a real redshift survey, where the 
overdensity field is sampled by discrete galaxies, the correlation function 
of the observed overdensity field contains an additional shot noise term, 
equation (3.28). 

We, sitting on a galaxy, the Milky Way, are not at a random location, 
but reside in an overdense region of the Universe, as galaxies are wont to 
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Figure 5. A wave of amplitude 5{k) in real space (thin line) appears as a wave with 
enhanced amplitude 5''{k) in redshift space (thick line) because of peculiar velocities 
(arrows). If the wavevector fc is along the line of sight, then the amplification factor is 
1 + /3. More generally, a wave that is angled to the line of sight appears amplified in 
redshift space by a factor 1 + /Sm^i equation (4.35), where = z.k is the cosine of the 
angle between the wavevector fe and the line of sight z. 

do. To guard against 'local bias', it would a reasonable precaution to excise 
at least part of the local region when analysing a galaxy survey. 

Furthermore, we are not at rest in the CMB. What to do about that is 
discussed in §4.3. 

Finally, it is much more straightforward to measure the selection func- 
tion n^(r) in redshift space than in real space. What to do about that is 
discussed in §4.4. 

4.2.1. The Linear Plane-Parallel Redshift Distortion Operator 
In the plane-parallel, or distant observer, limit, the linear distortion oper- 
ator (4.23) reduces to (the superscript p denotes plane-parallel) 

SP = 1 + /3^V-2 (4.33) 

where z is distance along the line of sight. In Fourier space, {d/dz)'^'V~'^ = 
kl/k"^ = /Lt^, where = z.fc is the cosine of the angle between the wavevec- 
tor k and the line of sight z. Thus in Fourier space the plane-parallel dis- 
tortion operator reduces to a diagonal operator 

S^' = l + /34, (4.34) 

so that, as illustrated in Figure 5, a Fourier mode 5^{k) in redshift space is 
simply equal to the unredshifted mode 5{k) amplified by a factor 1 -|- 



5'{k) = {l + p^il)5{k) 



(4.35) 
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It follows from equation (4.35) that, in the plane-parallel approximation, 
the redshift space power spectrum P^{k) is amplified by (1 + /9m|.)^ over 
its unredshifted counterpart P{k) 

P%k) = {l + ^^ilfP{k) , (4.36) 

an elegant formula first pointed out by Kaiser (1987, eq. [3.5]). 

The simplicity of formula (4.36) arises from the fact that the plane- 
parallel distortion operator is diagonal in Fourier space, which is inti- 
mately associated with the fact that plane-parallel redshift distortions pre- 
serve translation invariance. The translation operator — iV commutes with 
the plane-parallel distortion operator, and therefore the eigenmodes of the 
translation operator — iV, which are precisely the Fourier modes 5{k), are 
also eigenmodes of the plane-parallel distortion operator. 

Translation invariance means that redshifted Fourier modes remain sta- 
tistically orthogonal in the plane-parallel approximation, equation (3.24), 
preserving the advantage of the unredshifted power spectrum (§3.4). 

4.2.2. The Linear Radial Redshift Distortion Operator 
Unfortunately, the full distortion operator (4.23), which I call here the ra- 
dial redshift distortion operator^ to distinguish it from the plane-parallel 
distortion operator (4.33) or (4.34), proves to be quite a bit more compli- 
cated to deal with than the plane-parallel operator. The radial operator 
destroys translation symmetry, so that Fourier modes are no longer eigen- 
modes of the distortion operator, and the redshift space power spectrum 
{5^ {ki)5^ {k2)) is no longer a diagonal matrix. The linear radial distortion 
operator in Fourier space is an integral operator, a continuous matrix, 

S(fc,fe') = y"e'^-^S(r)e-'^'-^dV (4.37) 
= {2T,)HD{k - k') [l + p^k^V^'k-^^ - (3a{k - k')-^k'Vpk'-'' 

where Vj^ = d/dk, and a(fc) = / a{r)e'^^''^ d^r is the Fourier transform 
of a{r). The distortion operator (4.37) yields the redshifted Fourier modes 
5^{k) when acting on the unredshifted Fourier modes 5{k) 

5%k) = J S{k,k')S{k')d^k'/{2Trf . (4.38) 

The form of the linear radial distortion operator in Fourier space is dis- 
cussed by Zaroubi & Hoffman (1996). 

^Hamilton & Culhane (1996) call it the spherical distortion operator. 
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The radial redshift distortion operator does however preserve angular 
symmetry about the observer^, so that at least some of the merits of the 
power spectrum can be retained by working in spherical harmonics. In 
particular, spherical harmonic modes remain statistically orthogonal with 
respect to the angular indices in redshift space (§3.4), a point emphasized 
and taken advantage of first by Fisher, Scharf & Lahav (1994), and then 
by Heavens & Taylor (1995). Mathematically, the distortion operator S 
commutes with the angular momentum operator L, so that the eigenmodes 
of the angular momentum operator, which are the spherical harmonics Yim, 
are also eigenmodes of the distortion operator. 

Hamilton & Culhanc (1996, §4) point out that the linear radial dis- 
tortion operator (4.23) is nearly scale-free, to the extent that the quan- 
tity a{r) is approximately constant. If follows that the distortion oper- 
ator commutes approximately with the operator d/dlnr, the logarith- 
mic derivative with respect to depth r, or equivalently with the opera- 
tor d/dlnk = —d/dlnr — 3, the logarithmic derivative with respect to 
wavenumbcr k. A complete set of commuting Hcrmitian operators for the 
spherical distortion operator S is then, to the extent that a{r) is constant, 

The operators and are the square and z-component (along some arbi- 
trary axis) of the angular momentum operator L = irxd/dr = ikx d/dk, 
which is the same operator in real and Fourier space. The radial eigenfunc- 
tions are radial waves in logarithmic depth r or wavenumber k, while the 
angular eigenfunctions are the usual orthonormal spherical harmonics Y^^. 
Thus the eigenfunctions of the commuting set (4.39) are spherical waves 
with radial parts in logarithmic real or Fourier space. 

There are two drawbacks to the basis of logarithmic spherical waves 
proposed by Hamilton & Culhane. First. a(r), the logarithmic derivative of 
r'^n{r), equation (4.24), is not really constant, although it is slowly varying, 
and it is typically nearly constant in the nearer part of a redshift survey 
where the selection function is found empirically to approximate a power 

® Correctly, the distortion operator preserves angular symmetry to the extent that the 
selection function is independent of direction, so that the quantity a{r) in the distortion 
operator (4.23) is a function a(r) only of depth r, as is true in a survey that is uniformly 
fiux-limited within its angular boundaries. Over unsurveyed regions of the sky (where 
a{r) is zero divided by zero), the distortion operator can be taken to have the same 
form as over the surveyed part, as discussed in §3.2.1. To avoid confusion, it should 
be understood that it is the 'true' redshift power spectrum that possesses orientation 
symmetry. When the power spectrum is windowed through a non-all-sky survey, then the 
lack of angular symmetry of the window destroys the angular symmetry of the windowed 
redshift power spectrum. 
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law. The second problem is that it is not enough that the modes should be 
eigenfunctions of the distortion operator. It is also desired that the modes be 
statistically orthogonal, or nearly so. Now the modes would be statistically 
orthogonal if the power spectrum were, like the distortion operator, scale- 
free, i.e. a power law P{k) oc /c", which may not be a bad approximation 
in reality. However, in any real survey, Poisson sampling noise, §§3.1.2 and 
3.2.2, destroys the scale-free symmetry, except in the case that the power 
spectrum itself is that of Poisson noise, P{k) = constant. It remains to be 
seen if this idea can be exploited. 

The radial redshift distortion operator (4.23) is not Hermitian, unlike 
the plane-parallel distortion operator (4.33), which is Hermitian. Amongst 
other things, this means that the eigenfunctions of the radial distortion 
operator do not all have real eigenvalues. Presumably this means that red- 
shift distortions cause modes close to the observer to undergo a 'phase shift' 
in passing from real to redshift space. The Hermitian conjugate of the 
distortion operator is 

s. = i + ,v-V-^(|-5M),.. («o, 

Equation (4.40) is valid in the CMB frame. The Hermitian conjugate of 
the redshift distortion operator in the Local Group frame is given by equa- 
tion (4.50). 

Tegmark & Bromley (1995) derive an expression for the inverse S~^, the 
Green's function, of the distortion operator for the particular case a = 2, 
corresponding to a volume-limited sample. 

4.3. PECULIAR VELOCITY OF THE LOCAL GROUP 

The redshift distortion operator (4.23) is valid from the point of view of 
observers at rest with respect to the CMB. The Milky Way however is not 
stationary, but moving. The linear part of the motion of the Milky Way is 

the peculiar velocity of the Local Group (LG) of galaxies. The essential 
feature of the Local Group, which contains about 30 galaxies including our 
own, is that it is the local region of space, about 1 Mpc in radius, that has 
turned around from the general Hubble expansion of the Universe, and is 
beginning to fall together for the first time. 

In the LG frame, the redshift distance s^*^ of a galaxy at true position 
r relative to the observer is 

s^*^ = r + v- r.v^'-^ (4.41) 

which differs from the redshift distance s = r + v in the CMB frame by the 
component of —v^^ along the direction f to the galaxy. Prom conservation 
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of galaxies, n^'^ {s^'^) (P s^^ = n{s)(Ps, one concludes that, at distances 
much greater than the LG peculiar velocity, r S> v^'^, the redshift space 
overdcnsity 5'^^'^{r) observed in the LG frame is related to the redshift 
space overdensity 6^{r) in the CMB frame by 

^ LG 

J^LG (^r) = S'ir) + a{r) . (4.42) 

The difference term is a dipole directed along the LG motion v^^, modu- 
lated by the function a(r), equation (4.24). 

Since the LG shares in the linear motion of the nearby region, there is 
no dipole distortion of the nearby region in the LG frame. A dipole distor- 
tion appears in the CMB frame because of the streaming of galaxies past 
the stationary observer. Galaxies streaming toward the observer appear 
compressed to a smaller volume in redshift space by the spherical conver- 
gence of the volume element, and conversely galaxies downstream appear 
decompressed. 

Two general strategies to deal with the redshift distortion produced (or 
rather removed) by the motion of the LG are described below. A third 
strategy is to ignore the problem, which is what the plane-parallel approx- 
imation does. 

4.3.1. The Distortion Operator in the Local Group Frame 
The first strategy, used by Fisher, Scharf & Lahav (1994), is to work entirely 
in the LG frame, and to write the LG peculiar velocity as its value in linear 
theory, which is given by the usual formula (4.19) for peculiar velocity, 
evaluated at the position of the LG, the origin, r = 0: 

. (4.43) 

r=o 

An equivalent expression is 

which expresses the usual fact that peculiar velocity in linear theory is 
proportional to peculiar gravity, which is proportional to the integrated 
dipole overdensity about the observer. 

The value (4.43), inserted in equation (4.42), combined with the redshift 
distortion equation 6^ = SS in the CMB frame, equation (4.22), yields the 
distortion equation for the redshift overdensity S^^^ in the LG frame 



V 



LG 



^ dr 



(4.45) 



LINEAR REDSHIFT DISTORTIONS 



37 



where S^*^ is the hnear redshift distortion operator (4.23) modified to the 
LG frame 



1 + 13 



d"^ a{r)r 
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V" 



(4.46) 



The first d/dr in the a term is the gradient evaluated at the position r, 
while d/dr\^^Q is the gradient evaluated at the position of the LG. The 
assumption of linear theory here automatically guarantees that the peculiar 
velocities of galaxies in the LG frame are small compared to their distances, 



(4.47) 



unlike the corresponding condition (4.29) in the CMB frame, which had to 
be incorporated as an additional assumption. 

The redshift distortion operator S^^ in the LG frame, equation (4.46), 
can also be derived directly along the same lines as the derivation in §4.2 of 
the distortion operator S in the CMB frame, equation (4.23). The main dif- 
ference is to replace v ^ v — r.v^^ throughout the derivation; for example, 
equation (4.30) in the CMB frame changes, in the LG frame, to 



(4.48) 



which is Kaiser's (1987) equation (3.3). 

The redshift correlation function ^*^^(ri2, ri, r2) in the LG frame is 
related to the unredshifted correlation function C(ri2) by 



^^^(ri2,ri,r2) = SY'S\'' ^{r^2) 



(4.49) 



which may be compared to the corresponding equation (4.32) in the CMB 
frame. 

The Hermitian conjugate of the linear distortion operator S^*^ in the LG 
frame can be written in a variety of ways. One of the simpler ways, which 
however does not reveal the subtraction of the LG velocity as manifestly 
as does the expression (4.46) for the distortion operator S^*^ itself, is 



gLGf = 1 + ^ 



V-2r--2|- ( ^ 



dr 



d_ 
dr 



a(r) 



d_ 
dr 



V ^a(r)r 



r=o 



(4.50) 

in which the last term inside the square brackets on the right hand side 
is the term arising from the motion of the LG. This equation (4.50) may 
be compared to the corresponding equation (4.40) for the Hermitian con- 
jugate of the distortion operator in the CMB frame. Equation (4.50) is 
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used in equation (8.13), and the equation (8.14) following that one gives 
a rearrangement that brings out the subtraction of the LG velocity more 
clearly. 

4.3.2. Using the Known Value of the Local Group Motion 
The second strategy to deal with the motion of the LG is to use its known 
value, measured from the dipole anisotropy of the CMB radiation, corrected 
for the motion of the solar system about the centre of the Milky Way, and 
for the motion of the Milky Way within the Local Group (Yahil, Tammann 
& Sandage 1977). The LG pecuhar velocity is (Kogut et al. 1993, Table 3; 
Lineweaver et al. 1996) 

= 627 ± 22 km s'^ towards I = 276° ±3°, 6 = 30° ± 3° (4.51) 

in the constellation of Hydra. 

Thus if the rcdshift ovcrdcnsity 5'^^'^ is measured in the LG frame, 
then it can be corrected to yield the overdensity (5* in the CMB frame by 
subtracting a dipole term in accordance with equation (4.42), 

LG 

S'{r) = S'^^{r) - a(r)^^^^ . (4.52) 

If a{r) is a function only of depth r, independent of direction, then the 
adjustment is a pure dipole overdensity, but in the general case the adjust- 
ment involves other harmonics also. The redshift correlation function of the 
overdensity in the CMB frame is given by equation (4.32). 

An alternative version of the second strategy is to adjust the redshift 
distances from the LG frame to the CMB frame using equation (4.41), 
s = + r.v^^, and to measure the overdensity (5^ in the resulting CMB 
frame. However, this alternative version is not equivalent to the original 
when the condition \v\ <C r is violated. Closer examination of the derivation 
of the distortion operator S, equation (4.23), reveals that in effect it assumes 
that any streaming motion past the observer induces a dipole in the redshift 
overdensity. It follows that the best procedure is to measure the overdensity 
in the frame where the streaming motion vanishes, which is the LG frame, 
and then to transform to the CMB frame by making a dipole correction to 
the overdensity, as previously recommended, equation (4.52). 

If the overdensity 6^{r) in the CMB frame is constructed according to 
the recommended procedure — measure the overdensity in the LG frame 
and then correct to the CMB frame by subtracting a dipole streaming term, 
equation (4.52) — then the distortion equation (4.22) in the CMB frame 
will remain correct even without the condition \v\ <C r, equation (4.29). 

Which strategy is better for dealing with the motion of the LG, to 
work entirely in the LG frame as described in §4.3.1, or to make a dipole 
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correction and to work in the CMB frame, as proposed in this subsection? It 
seems to be largely a matter of judgement or convenience. The difference in 
information content is not great. If one likes, the difference can be isolated 
into the single mode with the specific form a{r)r.v^^ /r, equation (4.42), by 
making all other modes orthogonal to it (Tegmark et at. 1997). In the case 
of an all-sky survey with a uniform flux limit, the mode is a dipolc mode. 
For a non- all-sky survey with a uniform flux limit, the a{r)r.v^'^/r mode 
is effectively a 'cut' dipole mode, and making observed modes orthogonal 
to this mode means making them orthogonal to the cut dipole. 

Note that the peculiar velocity of the LG in the second strategy is not 
used in order to allow a measurement of (3 from any direct comparison 
between peculiar velocity and density fields; rather the LG velocity is used 
to introduce a dipole redshift distortion, which combines statistically with 
all the other redshift distortions to yield a statistical estimate of (3. 

4.3.3. Rocket Effect 

The fact that the peculiar motion of the observer induces a dipole overden- 
sity in redshift space in the direction of motion leads to an effect called the 
rocket effect, first pointed out by Kaiser (1987, §2; Kaiser k, Lahav 1988, 
p. 366). This dipole needs to be distinguished carefully from the true dipole 
anisotropy of the density around the observer, which in the standard gravi- 
tational instability picture is responsible for inducing the observer's peculiar 
velocity, equation (4.44) (Strauss et al. 1992b). 

Consider for example an observer situated in a perfectly uniform dis- 
tribution of galaxies, all with zero peculiar velocities. Suppose that the 
observer hypothesizes that he is actually moving at some velocity v rela- 
tive to the background. This motion will induce a spurious dipole in the 
direction of motion, which the witless observer may interpret as the dipole 
pulling him. 

A complementary problem arises if an observer attempts to correct the 
dipole around him for the reflex motion induced on the observer by that 
dipole. Here there is a tendency to instability at large depths, beyond the 
median depth of the survey, where a(r) goes negative (Nusser & Davis 
1994). 

4.4. REAL AND REDSHIFT SPACE SELECTION FUNCTIONS 

The distortion equation (4.22) with the distortion operator S given by (4.23) 
provides a valid prediction 5*(r) of the observed redshift overdensity ^^^^^{r) 
provided that the redshift overdensity is defined relative to the real space 
selection function n(r), as in equation (3.17). In the practical case however, 
the selection function is likely to be estimated in redshift space. This sub- 
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section examines how the real and redshift space selection functions n(r) 
and n*(r) are related for the typical case of a flux- limited redshift survey. 

Suppose that the redshift survey includes all galaxies brighter than some 
flux limit -Fiim, and suppose for simplicity that this limit is uniform over the 
angular extent of the survey. It is convenient to characterize the luminosity 
of a galaxy of flux F and true distance r by the maximum distance to 
which the galaxy could be moved and still remain above the flux limit: 

TL = r -— . (4.53) 



The limiting distance ri of a galaxy is just a measure of its luminosity 
L, since L = Anr^F = 47rr|^Fiii„. Define ^(ri)dlnrx, to be the luminosity 
function, the mean number density of galaxies in a logarithmic interval 
dlnr^ of limiting distances (which is the same, up to a factor, as the mean 
number density of galaxies in a logarithmic interval of luminosity, or in an 
interval of absolute magnitude). The selection function n(r) at (true, not 
redshift) depth r in this flux-limited redshift survey is the number density 
of galaxies luminous enough to be seen to depth r, which is the luminosity 
function integrated over tl > r 

/oo 
(f){rL) dlnvL ■ (4.54) 

All modern methods of measuring the selection function (Binggeli, Sand- 
age & Tammann 1988; Koranyi Sz Strauss 1997) seek to eliminate the 
(otherwise dominant) uncertainty arising from density inhomogeneity by 
assuming that the luminosity function ^(r/,) is a universal function, in- 
dependent of the local density, and estimating the luminosity function by 
counting relative numbers of bright and faint galaxies in identical volumes. 
If bright and faint galaxies at any point share the same peculiar velocity, as 
seems probable in the linear regime, then the measurement of the selection 
function in redshift space should remain unbiased by inhomogeneity. A sys- 
tematic difference between the real and redshift space luminosity functions, 
hence selection functions, does however result from the fact that radial pe- 
culiar velocities v shift galaxies not only in space, s = r + v, but also in 
luminosity, since the apparent limiting distance sl of a galaxy in redshift 
space, defined by 

^'■''> 

is s/r times its true limiting distance r^, equation (4.53). 

Hamilton & Culhane (1996) claim that the real and redshift selection 
functions agree to linear order, but this is false. What is true is that the ex- 
pectation values of the real and redshift selection functions, averaged over 
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an ensemble of observers, agree to linear order, essentially because the pe- 
culiar velocity averaged over an ensemble of observers is zero every^vhere, 
{v{r)) = (a difference between cnscmblc-averagcd real and rcdshift selec- 
tion functions arises at next order, since the ensemble average dispersion 
of peculiar velocities is non-zero) . In the actual case, however, the peculiar 
velocity does not vanish, and the actual real and redshift selections differ 
to linear order. 

4.4.1. Turner's Method 

As a typical procedure, consider Turner's (1979) procedure for estimating 
the selection function. It can be shown that Turner's method, when applied 
over a grid of depths r, yields the same selection function as that of the 
"Stepwise Maximum Likelihood" method of Efstathiou, Ellis &: Peterson 
(1988), although Turner's method does not yield uncertainties. Turner's 
procedure is to construct the selection function n(r) from 

din n(r) dN(rL,r) 
dlnr N{r,r)dlnrL 

where N{rL,r) is the observed number of galaxies brighter than ri and at 
the same time closer than r. In practice, equation (4.56) is solved on a grid 
of depths r, in which case the equation becomes, for bins of logarithmic 
width A Inr, 

A In n(r) _ Nire^^""^ ,r) - N{r,r) 
A Inr ~ N{r,r) ' \ ■ ) 

As pointed out by Strauss, Yahil & Davis (1991), if observed fluxes are 
rounded into discrete bins (for example, Zwicky magnitudes are rounded 
mostly to 0.1 magnitudes), then the grid in r must be chosen carefully to 
avoid a discretization bias. For a logarithmic spacing A In F of discretized 
fluxes, the estimate of the selection function is unbiased if the logarithmic 
spacing A Inr of depths is taken to be an integral multiple of Aln(F^/^); 
the estimate is most biased if A Inr is taken to be half of Aln(Fi/2)_ ^s 
with all such methods. Turner's procedure leaves the overall normalization 
of the selection function undetermined. The normalization must be fixed as 
a separate step. 

Suppose that Turner's method (4.56) is applied in redshift space, so 
that the redshift space selection function fi.*(s) is estimated from 

d\ns N^{s,s) dlnsL 

where N^{sl,s) is the observed number of galaxies brighter than sl and 
closer than s in redshift space. How is the redshift space selection function 



(4.56) 



(4.58) 
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estimated by (4.58) related to the real space selection function? Let /{tl, r) 
denote the number density of galaxies in an interval din tl d^r of luminosity 
and position in real space, and similarly let /*(si,,s) denote the number 
density of galaxies in an interval dlnsLd^s in redshift space. Conservation 
of galaxies implies 

f''{sL,s)dlnsLd^s = f{rL,r)dlnrLd^r (4.59) 

with s = r + V and sl = r^s/r, which generalizes equation (4.25). The 
relation s = r + v is valid in the CMB frame; in the LG frame, replace 
V ^ V — r.v^^. The assumption that the luminosity function 4'iTL) is 
a universal function, independent of position, implies that the real space 
number density f{rL,r) factorizes to 

f{rL,r) = <P{rL)[l + d{r)] . (4.60) 

Given this assumption (4.60), a derivation similar to that leading from (4.25) 
to (4.30) shows that, to linear order in the overdensity S{r) and peculiar 
velocity v{r), equation (4.59) implies 



firL,r) = (j){rL) 



(4.61) 



where the dimensionless quantity a0(ri) = dln[r'j^(p{rL)]/dlnrL is similar 
but not identical to the quantity a{r) defined by equation (4.24). By def- 
inition, the number N^{sl,s) of galaxies brighter than sl and closer than 
s is 

/•S /• POO 

N'{sL,s)= ris'L,s')dlns'Ld^s' (4.62) 

JoJaJsl 

where the angular part of the spatial integral is over the solid angle A of 
the survey. A few-step calculation using equation (4.61) in equation (4.62) 

leads to 

ON^jri-r) _ dlnn{rL) dHnnjr^) /v\ 
N^{rL,r) dlnvL dlnri dlnr^ \r 

where n{rL), the real space selection function, is the integral (4.54) of the 
luminosity function, and {v/r)r is the volume-average of v/r within depth 
r over the solid angle A of the survey 

'v\ _ 3 v{r') 
, r L Ar^ 



77 ^d3/. (4.64) 



Equation (4.63) shows that the basic relation between the real and redshift 

space selection functions, when the latter is estimated according to Turner's 
procedure (4.58), is, to linear order in the overdensity and peculiar velocity, 

dln[n(r)/n*(r)] d^lnn{r) /v\ (4 65) 



dlnr dlnr^ \r 
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Integrating equation (4.65) yields 



n r 



, .^y. , . , '■m'l'' In n(r') /?;\ dr' 

l^J = " X ~dh[?^ Vrl - ("-'^^ 

where the limit rmax of integration is an arbitrary depth which can conve- 
niently be taken to be, say, the maximum depth of the survey. Changing 
^max changes ln[rl(r)/n*(r)] by an integration constant, the arbitrariness of 
which reflects the fact that Turner's method leaves the overall normalization 
of the selection function undetermined. Changing the order of integration 
transforms equation (4.66) to 

(^) = - in '^f"^^^^' ^'^^ ^ '''' ^'-''^ 

where the integration is over the volume of the survey, and the function 
a(r) is 

3 r^-^ 1 d^\nn(r') dr' 

In practice it would normally be better to estimate the redshift selec- 
tion function n^^^[r) in the LG frame rather than in the CMB frame, 
since the derivation of equation (4.61) requires amongst other things that 
\v\ <^ r, which is not necessarily satisfied at small depths r, whereas the 
corresponding condition \v — f .t>^^| ^ r in the LG frame is automatically 
satisfied, at least over linear scales. Equations in the LG frame are obtained 
by replacing v ^ v — r.v^^ everywhere. In the LG frame, equations (4.66) 
and (4.67) become 

/ n(r) \ /■''max In n(r') f/v\ /r\ lg\ 

U^L^(r)y ^ ~Jr ~dh[?^[\rl>~\rlr'" )V 

a[max(r, /)] [^(^0 ~ ^ J ^3^/ ^4 ^9^ 



Jo Ja 



where {r/r)r = 3/{Ar^) JoJAi'f'' /''^')d^f^' is defined to be the volume average 
of f/r within depth r over the solid angle A of the survey, similarly to the 
definition (4.64) of {v/r)r- Note that {'r/r)r = in the case of an all-sky 
survey. 

Just as there were two strategies to deal with the LG motion, described 
in §§4.3.1 and 4.3.2, so here there are two general strategies to deal with the 
difference between real and redshift selection functions. The two strategies 
are described in the next two subsections, §§4.4.2 and 4.4.3. 
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4.4.2. The Distortion Operator Relative to the Redshift Selection Function 
The first strategy is to define the redshift overdensity 6^^{s) relative to 
the selection function n*(s) in redshift space, as in equation (3.18), and to 
modify the linear redshift distortion operator S accordingly. 

It is straightforward to ascertain that the overdensities (5**(r) and 5^{r) 
defined respectively relative to the redshift and real space selection func- 
tions, equations (3.18) and (3.17), are related to linear order by 

6''{r) = 6%r) + ln[n{r)/n%r)] . (4.70) 

Equation (4.67), together with the usual linear expression (4.31) for the 
radial peculiar velocity v, shows that, in the case of Turner's (1979) method 
applied to a flux-limited redshift survey, ln[n(r)/n*(r)] can be written 

ln[n{r)/n'{r)] = j S^{r,r')5{r') Sr' (4.71) 

(more compactly, the right hand side of eq. [4.71] is just S"(5 in the notation 
of matrices and vectors in Hilbert space, §3.3) where the linear operator S" 
is 

S^(r,r') = /3a[max(r,r')]w(r')^V'-2 (4.72) 

r or 

with uj{r') = 1 inside, and outside, the volume of the survey. 

Thus the linear redshift distortion equation for the redshift overdensity 
S'^ is 

S'' = S'5 (4.73) 

where the linear distortion operator is the sum of the linear distor- 
tion operator S, equation (4.23), and an extra piece S" given, for Turner's 
method, by equation (4.72): 

= S + S" . (4.74) 
A complete expression for the distortion operator S'' is 

S^(r, r') = 6n{r - r') [l + /3 |^V-) + /? ^^^^7 V'- (4.75) 

where a*(r,r') is 

a^{r, r') = SDir — r')a(r) + a[max(r, r')](jj{r') (4.76) 

with a(r) and a(r) given by (4.24) and (4.68), and again u>{r') = 1 inside, 
and outside, the volume of the survey. 
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In practice, it makes more sense to apply this strategy in the LG frame 
than in the CMB frame, for the reason given in the paragraph containing 
equation (4.69). In the LG frame, equations (4.70)-(4.75) carry through, 
but superscripted LG appropriately. In the LG frame, the overdensities 
gssLG g^^^ gsLG related by 



psLG(^^ = S''^^{r) + ln[n(r)/n^L^(r)] . (4.77) 

In the case of Turner's method applied to a flux-limited survey, equa- 
tion (4.69) shows that ln[n(r)/n*^^(r)] = / S''^'^{r,r')6{r') (fr' where the 
operator S"'^*^ is 



jnLG/ 



'{r,r')=P- 



(-- 


d 




\dr' 


dr' 


r'=o) 



V'-^ . (4.78) 



The distortion equation for the redshift overdensity 5^^ in the LG frame 
is 

^^sLG^gsLG^ (4.79) 

where the linear distortion operator S*^*^ is the sum of the distortion op- 
erator S^*^, equation (4.46), and the extra piece S"'"*^ given, for Turner's 
method, by equation (4.78): 

gsLG^gLG^gnLG _ (4 30^ 

A complete expression for the distortion operator S*^*-' is 



+ P 



Qj.2 

a^{r,r')r' 



(-- 


d 




\dr' 


dr' 


r'=o) 



V'-2 (4.81) 



with a^{r, r') given by equation (4.76). In evaluating Q;*(r, r'), the selection 

function n(r) which enters the definitions (4.24) of a{r) and (4.68) of a{r) 
can be replaced to linear order by the measured redshift selection function 

Equation (4.81) is an important equation. To recapitulate, it gives the 
form of the linear redshift distortion operator S'^^'^ for the case of a flux- 
limited redshift survey in which the redshift space overdensity (5**lG|-^^ 
defined (a) in the frame of reference of the LG, and (b) relative to the 
redshift space selection function n*'"*^(r) measured using Turner's (1979) 
method, equation (4.58), again in the LG frame. That is, the distortion 
equation (4.79) gives the theoretically predicted form of the overdensity 
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^-ssLG^^-j in the linear regime, to be compared to the observed redshift 
over density S^y^l'^(r) defined by 



with n^^^(r) the observed galaxy density at redshift position r in the LG 
frame, and n^^^{r) the selection function measured in redshift space in the 
LG frame by Turner's method (4.58). 

The redshift correlation function ^^^{ri2,ri,r2) of the overdensity S^^{r) 
in the linear regime is related to the unredshifted correlation function ^(ri2) 

by 

r(ri2,ri,r2) = SfS|e(ri2) ■ (4.83) 

Similarly the redshift correlation function ^**^^(ri2, ri, r2) of the overden- 
sity S^^^^(r) in the LG frame is 

r'^''(n2,ri,r2) = Sf^Gg^LG^^^^^) _ (4_84) 



4.4.3. Correcting the Redshift to the Real Space Selection, Function 
The second strategy is to measure the redshift space selection function 
n*(r) as before, but then to correct it to the real space selection function 
n(r). 

The correction is given to linear order by equation (4.66), for the case of 
a flux-limited redshift survey in which the redshift space selection function 
is estimated by Turner's (1979) method, equation (4.58). Actually it would 
normally be preferable to estimate the redshift selection function n*^*^(r) 
in the LG frame, and to correct this to the real selection function using 
equation (4.69). More precisely, the correction is 

y f H^) \_ r-^'^^ d?\nn''^'^{r') nv\ /r\ ^lq\ dr' 
\n^^'^{r)J Jr (ilnr'2 \\r \r /^i' J r' 

(4.85) 

which is the same as equation (4.69) except that the redshift selection 
function n^^^{r') (the thing you have) replaces the real selection function 
n{r') (the thing you want) in the integrand on the right hand side of equa- 
tion (4.85). The replacement is valid to linear order because n*'"*^(r) and 
n(r) agree to zeroth order. 

The correction (4.85) involves modelling the volume-average {v/r)r, 
equation (4.64), of the radial peculiar velocity v divided by depth r, as 
a function of depth r in the survey. Note that {v/r)r is an average of the 
actual v/r, not an ensemble average. One way to do this is to carry out 
a complete reconstruction of the unredshifted density and velocity field 



LINEAR REDSHIFT DISTORTIONS 



47 



(Yahil et al. 1991; Fisher et al. 1995b; Webster, Lahav & Fisher 1997). 
Complete reconstruction seems extravagant merely for the purpose of esti- 
mating {v/r)r; perhaps a simpler approximation could be devised. 

It is to be noted that in an all-sky survey {v/r)r is negative if galaxies 
are on average converging towards the observer, which occurs if the observer 
is in an over dense region of the Universe. The LG lies in such an overdense 
region, as is typical for galaxy-bound observers, which is an example of 
local bias. 

In a non-all-sky survey, {v/r)r depends in addition on the alignment of 
the survey with large scale flows. 

However carefully {v/r)r is modelled, there is liable to be some un- 
certainty in it, hence some uncertainty in the correction ln[n{r)/n^^^{r)], 
equation (4.85). The correction ln[n(r)/n*^*^(r)] to the selection function 
is essentially a correction to the redshift overdensity, as in equation (4.77). 
Measurements of the redshift space overdensity can be immunized against 
uncertainty in the radial correction by discarding from the statistical anal- 
ysis the uncertain radial mode or modes — possibly all radial modes, to 
be conservative — and by making all other observed modes orthogonal to 
these radial modes (Tcgmark et al. 1997). In the case of an all-sky survey 
with a uniform flux limit, radial modes are monopole modes about the ob- 
server. For a non-all-sky survey with a uniform flux limit, radial modes are 
effectively 'cut' monopole modes, and observed modes can be immunized 
against radial uncertainty by making them orthogonal to the cut monopole 
modes. 

5. Methods to Measure /? 

To date, essentially three types of method have been used to measure /? 
from linear redshift distortions: 

1. The ratio of angle-averaged redshift space to real space power spectrum 
or correlation function; 

2. The ratio of quadrupole to monopole moments of the redshift space 
power spectrum; 

3. A maximum likelihood approach in which the data are the amplitudes 

of (many hundreds of) individual modes, and the parameters are /? 
and the power spectrum P{k), parametrized in some way. 

These methods are discussed in turn below. 
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5.1. RATIO OF REDSHIFT SPACE TO REAL SPACE ANGLE- AVERAGED 
POWER SPECTRA 

This method was proposed in Kaiser's (1987) original paper. It assumes the 
plane-parallel, or distant observer, approximation, where structures are suf- 
ficiently far away that line-of-sight peculiar velocities are effectively plane- 
parallel, as illustrated in the left panel of Figure 1. 

Kaiser's formula (4.36) predicts that in the linear regime the angle- 
averaged redshift power spectrum P^{k) = J P^{k)doj^/{4Tr) [which is the 
same thing as the monopole redshift power -Pq (fe) discussed in §5.2] is am- 
plified by a constant factor over the unredshifted power spectrum P{k): 

P{k) 3 5 ^ ^ 

As (3 varies between and I, the amplification factor varies between 1 and 
28/15, almost a factor of 2. 

Equation (5.1) is valid for any wavenumber k in the linear regime, so 
remains valid if the real and redshift space power spectra are first smoothed 
over any window W{k) that picks out linear wavenumbers. That is, if 

POD 

P{k)= W{k)P{k)A-Kk^dk/{2T:f (5.2) 
io 

and similarly for P^{k), then the ratio of the smoothed redshift to real 
space power spectra is predicted to be 

ps 2 1 

— = 1 + 3/?+ 5/3'- (5.3) 

In particular, the ratio of the angle-averaged redshift space correlation func- 
tion ^^(r) = / ^*(r)(io7-/(47r) [which is the monopole redshift correlation 
function ^o('')] to its unredshifted counterpart ^(r) is, at linear separations 
r, 

e(r) ^3'^^5'^ ' ^ ' 

which follows from equation (5.3) with the smoothing window in (5.2) taken 
to be the zeroth spherical Bessel function, W{k) = J e~^^-''^ dof^/{4-ir) = 
joikr). 

The unredshifted correlation function ^ (r) and/or the unredshifted power 
spectrum P{k) can be measured by deprojecting the angular correlation 
function of a survey. This deprojection is liable to be quite noisy, unless the 
survey is large. Thus the method of inferring /3 from the ratio of redshift to 
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real correlation functions or power spectra is most appropriate if the red- 
shift survey happens to be a subset of a larger angular survey. Examples of 
this are the Stromlo-APM survey (Loveday et al. 1996b), which is a l-in-20 
redshift survey of galaxies brighter than hj = 17.15 in the APM angular 
survey, or the QDOT survey (e.g. Saunders et al. 1990; Rowan-Robinson 
et al. 1991; Saunders, Rowan-Robinson & Lawrence 1992; Lawrence et al. 
1997, in preparation) which is a l-in-6 redshift survey of galaxies brighter 
than 0.6 Jy from the IRAS Point Source Catalog. 

5.2. RATIO OF QUADRUPOLE TO MONOPOLE HARMONICS OF THE 

POWER SPECTRUM 

The idea of decomposing the redshift correlation function into harmon- 
ics was proposed by Hamilton (1992), and the connection to the power 
spectrum was clarified by Cole, Fisher Sz Weinberg (1994). Integral expres- 
sions for the shape of the redshift correlation function had previously been 
given by Kaiser (1987), Lilje k Efstathiou (1989), and McGiU (1990). This 
method, like method 1, assumes the plane-parallel approximation. 

For plane-parallel redshift distortions, the redshift power spectrum can 
be written as a sum of even harmonics -P/(A;) 

P%k) = J2 'Peil^k)Piik) , P/(fe) = (2^ + 1) J Veitik)P%k)doj,/47r 

£ even 

(5.5) 

where Veifif^) are Legendre polynomials, and //y^ = z.k is the cosine of the 
angle between the wavevector k and the line of sight z. The odd harmon- 
ics vanish by pair exchange symmetry, and non-zero azimuthal harmonics 
(corresponding to Yi^s with m 0) vanish by symmetry about the line of 
sight. 

In the linear regime, Kaiser's formula (4.36) shows that the redshift 
power spectrum reduces to a sum of monopole (^ = 0) , quadrupole {i = 2), 
and hexadecapole {i = 4) harmonics 

P^k) = Vo{fik)Po{k) + r2{fik)Pm + r4{fik)Piik) (5.6) 

where the harmonics P| of the redshift space power spectrum are related 
to the unredshifted power spectrum P{k) by 

Poik) = (l + ^/3 + ^/32)p(fc) 
Piik) = {^^+^P')Pik) 

P!{k) = ^i3'P{k). (5.7) 
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Monopole Quadrupole Hexadecapole 



Figure 6. Shapes of the monopole (£ = 0), quadrupole (£ = 2), and hexadecapole {I = 4) 
harmonics (thick lines) , drawn as perturbations of a sphere (thin lines) . The quadrupole 
has positive amplitude as drawn, as appropriate for the redshift power spectrum. The 
factor of between the power spectrum and the correlation function, equation (5.10), 
introduces a minus sign in the quadrupole correlation function, which therefore appears 
squashed, opposite to what is drawn. 



Equations (5.7) imply in particular that the ratio P2{k) / PQ{k) of quad- 
rupole to monopole harmonics of the redshift power spectrum in the linear 
regime is 

P|(fc)_ 1/3+1/3^ 

which therefore provides a means of measuring /?. Equations (5.7) predict 
that the hexadecapole harmonic Pl{k) should be quite small, ^ 0.1 of the 
monopole and quadrupole harmonics for /3 < 1, and it also proves to be 
noisier, especially at the largest scales, so in practice the quadrupole-to- 
monopole ratio has been the statistic of choice for many authors. 

The harmonics ^|(r) of the redshift correlation function are defined 
similarly to those (5.5) of the power spectrum 

f (r) = T'eMCeir) , ^l(r) = {21 + 1) J VeMCir) dor/iAn) 

t even 

(5.9) 

where nr = z.r is the cosine of the angle between the separation vector r 
and the line of sight z. The harmonics P^^k) of the redshift power spcctrTim 
are related to the harmonics ^l{r) of the redshift correlation function by 



P/(fc) = i*47r / jt{,kr)it{r) r'dr , 4|(r) = r'^TT / jiikr)Plik) k'dk/{27TY 
Jo Jo 

(5.10) 

where je{kr) are spherical Bessel functions. For the quadrupole harmonic, 
£ = 2, the factor in equation (5.10) introduces a negative sign between the 
power spectrum and the correlation function. Thus the large scale squash- 
ing effect of the redshift correlation function, a negative quadrupole, cor- 
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responds to a stretching of the redshift power spectrum along the hne of 
sight, a positive quadrupole (Cole, Fisher k. Weinberg 1994, Fig. 1). 

5.2.1. Smoothed Harmonics of the Redshift Power Spectrum 

In any finite survey, it is necessary to measure the power spectrum at finite 
resolution, given roughly by the inverse scale size of the survey. Consider 
then the harmonics of the redshift power spectrum smoothed over some 
window W{k) (the following equation generalizes eq. [5.2]) 

P/=/ W{k)PI{k)AiTk'^dk/{2'Kf . (5.11) 

JO 

As long as the window W{k) picks out wavenumbers in the linear regime, 
the quadrupole-to-monopole ratio of the smoothed harmonics will also 
satisfy the relation (5.8), 

The relation (5.10) between the harmonics of the power spectrum and the 
correlation function means that the smoothed harmonics P/ of the power 
spectrum, equation (5.11), can also be written as smoothed harmonics of 
the correlation function: 

/■oo 

P!= / W(,{r)it{r)ATTr-'dr (5.13) 
Jo 

where the real space smoothing window Wi{r) is related to the Fourier 
window W{k) by 

/•oo 

We{r)=i^4Tr je{kr)W{k)k'^dk/{2Trf . (5.14) 
Jo 

The point of equation (5.13) is that it shows how smoothed redshift power 
spectra P/ can be measured from quantities in real (redshift) space, which 
is where the data lie. 

5.2.2. Smoothing Windows 

Childers (1978, p. 1) terms the business of choosing smoothing windows 
W{k) "window carpentry". One possibility, proposed by Hamilton (1995) 
and used in §6 below, is to use smoothing windows that are power laws 
times a Gaussian, W{k) k" c:^^ with n an even integer, suitably scaled 
and normalized. These windows W{k) have the desirable properties that: 

o they are everywhere positive, so preserving the intrinsic positivity of 
the power spectrum; 
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o they vanish at zero wavenumber, W{k) = at A: = 0, provided that 
n > 2, so immunizing the measurement of power against uncertainty in 
the mean density (which makes a delta-function contribution to power 
at zero wavenumber); 

o they are analyticahy convenient; 

o they yield Gaussian convergence as a function of pair separation r 
in the corresponding real space windows We{r), equation (5.14), for 
harmonics i < n, provided that n is chosen to be an even integer. 

Suitably scaled, and normalized so / W{k)d^k/{2Tr)^ = 1, these smoothing 
windows are 

d^k _ 2e-i\-+^dq _ak 
^^^^Wf = r[(n+3)/2] ' ^ = T- ^'-''^ 

The wavenumber k is an 'effective' wavenumber, the constant a being cho- 
sen so that W(k) peaks at approximately k ^ k.ln Hamilton (1995) and §6, 
the constant a is chosen so that the smoothed monopole power at wavenum- 
ber k is equal to the unsmoothed monopole power at the same wavenumber, 
Pq (^) = -^0 (^)) for the particular case where the power spectrum is a power 
law of index —1.5, that is, for Pq (A;) oc k~^-^. For the case n = 2 in the win- 
dow (5.15), as used in Figure 7, this fixes a = 1.279. The harmonics of the 
power spectrum smoothed over the window (5.15) are, according to equa- 
tion (5.13), equal to the harmonics of the correlation function smoothed 
over corresponding windows We{r) given by equation (5.14): 

_ i'[{n-i)/2]\ , 2 ,+ (1/2) . 2^ .5 

(3/2)(„/2) %-i)/2^')' '-2a ^^-^^^ 

(note that VFo(O) = 1) where are Laguerre polynomials (Abramowitz &l 
Stegun 1964) and (3/2)(„/2) = r[(n+3)/2]/r(3/2) is a Pochhammer sym- 
bol. The smoothing windows (5.16) happen to be e~* times the radial 
wavefunctions of a 3-dimensional simple harmonic oscillator at energy level 
n and angular momentum (. (Landau &l Lifshitz 1958, §33 Problem 4). The 
quantization condition that the index {n — t)/2 be a non-negative integer 
ensures Gaussian convergence at large separations r. Despite the forbid- 
ding appearance, the smoothing windows Wi{r) are just polynomials times 
a Gaussian, with a rapid stable recurrence relation available to evaluate the 
polynomials. 

It is noteworthy that in order to measure the tth harmonic with one 
of the windows (5.15), it is necessary to choose n > £, so that the window 
must go to zero at least as fast as W{k) ^ k^ as k ^ 0. 
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5.3. MAXIMUM LIKELIHOOD 

Maximum likelihood (ML) techniques offer an optimal way to estimate /3 
and its uncertainty from redshift distortions. The technique is not only 
optimal, yielding both the most probable answer along with a complete 
probability distribution of answers, for a given prior, but is flexible enough 
to admit complications, such as the radial (as opposed to plane-parallel) 
character of redshift distortions, which can be difficult to deal with by direct 
methods. 

The ML technique was first applied to measure ^ from linear redshift 
distortions by Fisher, Scharf & Lahav (1994), and a more thorough imple- 
mentation was carried through by Heavens & Taylor (1995) and then by 
Ballinger, Heavens & Taylor (1995). 

Likelihood techniques, applied according to Bayesian precepts (Loredo 
1990), require a prior, that is, a precise statement of prior assumptions, 
including the range of models to be considered, and the prior probability 
distributions of the parameters to be measured. In the usual case, the prior 
probability of the parameters is taken to be uniform, in which case the 
posterior probability, the probability distribution of the parameters given 
the observed data (which is what you want), is proportional to the likelihood 
function, which is the probability of the observed data given the parameters 
(which is what theory tells you). 

When estimating (3 from linear redshift distortions, the prior is liable 
to include such assumptions as that the Universe is statistically homoge- 
neous and isotropic, that the distribution of over density on linear scales is 
a multivariate Gaussian, that the power spectrum takes such and such a 
parametrized form (e.g. Cold Dark Matter (CDM) or one of its cousins), 
that observations introduce such and such additional sources of uncertainty 
(notably Poisson sampling noise), that linear redshift distortions conform 
to the standard model described in §4, and that all values of /3 are a priori 
equally likely (which is called a uniform prior on the parameter (3). 

At first sight the need to state a prior seems a disadvantage compared 
to traditional direct approaches (where for example one measures j3 from 

some ratio of power spectra, and one measures power spectra by, well, 
by measuring power spectra). With second sight however, one realizes the 
advantages of the ML procedure: 

• Being forced to state one's prior assumptions explicitly is a virtue, not 
a drawback; 

• If changing your prior (within plausible limits) makes much difference 
to the values of the quantities you are trying to estimate, then you are 
not learning much from the data; 
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• The mathematical formahsm is quite general, specifying the best way 
to proceed even in highly complicated situations. 

This having been said, it is true that the ML formalism will happily 
provide answers for parameters even if the prior is completely wrong. Thus 
the prior should, correctly, always be tested for consistency with the data 
(a step that to date has generally not been taken). 

If the real overdensity S forms a multivariate Gaussian field, as may 
well be true on linear scales in reality, then the observed linear redshift 
overdensity ^^^s is also predicted to be a multivariate Gaussian (any linear 
combination of a Gaussian field is a Gaussian field), and the likelihood 
function £ is Gaussian 

where = (5* 5*^) is the expectation value of the survey covariance ma- 
trix, and |C^| and C^~^ are its determinant and inverse. The S^y^^ in the 
likelihood (5.17) is the observed vector of overdensities, while the covariance 
matrix is the prior. The likelihood function (5.17) gives the probability 
of the observed data, 6^^,^, given the prior, C. With a uniform prior on the 
parameters of the covariance matrix C*, the likelihood function becomes 
(up to a normalization factor) the probability of the parameters given the 
data. 

In general, the prior covariance is a combination of signal plus noise 
terms. In the simplest case, one supposes that the only source of noise in 
a redshift survey of galaxies is Poisson samphng noise (§§3.1.2, 3.2.2). The 
survey covariance C* is then a sum of cosmic and Poisson sampling terms 
(the following equation is a repeat of eq. [3.28]) 

Cfj = e{ri,rj)+6n{ri - r,)[n(r,)]-^ (5.18) 

where ^^(ri,rj) is the redshift space correlation function, and dnifi — rj) 
is a Dirac delta function. For fluctuations in the linear regime, the redshift 
correlation function ^^{ri,rj) is predicted to depend on the linear distor- 
tion parameter P and the unredshifted correlation function ^(r) (or equiva- 
lently the unredshifted power spectrum P{k), eq. [3.6]) in accordance with 
equation (4.32). The parameters to be measured are then the distortion pa- 
rameter P, and the parameters of some parametrization of the unredshifted 
power spectrum P{k). 

Actually, the 'observed' redshift overdensity S^^g{r), equation (3.17), 
depends not only on the observed galaxy density n*(r) in redshift space, 
but also on the selection function n(r), which is itself estimated from the 
survey. The shape of the selection function (or the parameters of some 
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parametrization thereof) would normally be measured as a separate op- 
eration, since that measurement typically involves additional assumptions 
about the universality of the luminosity function (see §4.4). This leaves the 
overall normalization of the selection function as an extra parameter to be 
measured from the likelihood function (5.17). In principle, uncertainty in 
the measurement of the shape of the selection function shoTild be allowed 
for by marginalizing over it, that is, by integrating the likelihood function 
over the probability distribution of the parameters of the shape of the se- 
lection function. In practice, the shape of the selection function is normally 
so accurately measured, compared to other uncertainties, that it can be 
treated as a fixed, known quantity in the likelihood analysis. 

In order to proceed numerically, it is necessary to pixelize the data in 
some fashion, so that the covariance matrix Cij becomes a finite matrix. 
In principle, the choice of pixelization is irrelevant, if the pixels are fine 
and many enough to represent the survey data accurately. In practice, the 
matrix Cij needs to be of tractable size, and some cunning is needed to pick 
a pixelization that is small but nevertheless contains virtually all the infor- 
mation about the parameters to be measured (Tegmark, Taylor & Heavens 
1996; Tegmark et al. 1997). Fisher, Scharf & Lahav (1994) chose the pixels 
to be spherical harmonics to ^ < 10 on the sky, with 4 Gaussian pixels in the 
radial direction. Heavens & Taylor (1995) and Ballinger, Heavens &; Taylor 
(1995) chose a basis of spherical waves, that is, spherical harmonics on the 
sky, with (weighted) spherical Bessel functions in the radial direction. 

Clearly the ML technique, or methods essentially equivalent to it, is the 
method of choice for measuring redshift distortions, certainly on the largest 
scales where a small number of pixels contains all relevant information, per- 
haps also on smaller scales if clever compression techniques can be applied 
(Vogeley Sz Szalay 1996; Tegmark, Taylor &; Heavens 1996; Tegmark et 
al. 1997). Undoubtedly there will be further development of this powerful 
approach in the future. 

One mysterious aspect of ML results published to date is that they yield 
estimates of /3 that appear to be systematically larger than other methods 
(see Tabic 8.1). One possible reason for the difference is the fact that the 
plane-parallel approximation underestimates p. However, authors who use 
the plane-parallel approximation have generally confined their measure- 
ments to opening angles no larger than 50°, at which point the A'^-body 
simulations of Cole, Fisher & Weinberg (1994, Fig. 8) indicate that the 
plane-parallel approximation underestimates f5 by only 5%. Another possi- 
bility, suggested by Ballinger, Heavens & Taylor (1995), is that direct meth- 
ods unfairly penalize larger values of (3, since larger values of f3 predict a 
larger variance in the power spectrum, which is not taken into consideration 
by direct methods. 
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The resolution of this mystery remains unclear. 

6. Example of Measuring P from Linear Redshift Distortions 

It is helpful to present an example of the measurement of P from linear 
redshift distortions, which is intended not only to illustrate how things 
work out in practice, but also (a) to bring out the difference between IRAS 
and optically selected galaxies when they are analysed with (essentially) the 
same procedure, and (b) to demonstrate the importance of nonlinearities 
(fingers-of-god) . 

The results are given first, in §6.1, and then §6.2 provides some details 
of the analysis. 

6.1. RESULTS 

Figure 7 shows the ratio (A;)/Pq (^) of smoothed quadrupole to smoothed 
monopole power measured as a function of effective wavenumber k in the 
IRAS QDOT + 1.2 Jy survey and the Stromlo-APM survey. These are the 
same surveys whose redshift correlation functions are plotted in Figure 3. 
The result for the QDOT + 1.2 Jy survey is from Hamilton (1995), who 
gives additional details of the analysis. The Stromlo-APM result, new to 
this review, follows essentially the same analysis. 

On linear scales (to the right of the graphs in Figure 7) the quadrupole- 
to-monopole ratio is expected to go over to a constant. While there is some 
indication, more particularly in the IRAS surveys, that the ratio is indeed 
asymptoting to a constant, it is evident that nonlinearities have some ef- 
fect even at scales as large as a half-wavclcngth^*^ of Ti/k ~ 40/i~"^Mpc. 
One should probably not believe the quadrupole-to-monopole ratio for 
Tr/k ^ 50^~^Mpc, where the uncertainties are becoming large. On small 
scales the nonlinear fingcr-of-god effect lowers the quadrupole-to-monopole 
ratio, so that the quadrupole power is negative below a half-wavelength of 
Tr/k « 5/i-^Mpc in the QDOT + 1.2 Jy survey, and below ir/k 8h ^Mpc 
in the Stromlo-APM survey. 

For the QDOT -|- 1.2 Jy survey, nonlinear effects appear to be ade- 
quately described by a model in which the linear distortion is modulated 
by a random exponentially distributed pairwise velocity dispersion (Fisher 
et al. 1994b) (see §7.2). A least squares fit to the quadrupole-to-monopole 
ratio over half-wavelengths 5-44/i~"^Mpc gives a one-dimensional pairwise 
velocity dispersion of 322kms~^. Although this method of measuring the 

^"The uncertainty principle ensures that the correspondence between wavenumber and 
separation is inevitably uncertain. Empirically however, I find that a half-wavelength -K/k 
typically gives a better estimate of the physical scale of separation than for example a 
full wavelength 27r/fe or an inverse wavenumber 1/k. 
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Figure 7. Ratio P2ik)/F'Q{k) of smoothed quadrupole to smoothed monopolc power 
as a function of effective wavenumber k in: (left) the merged IRAS QDOT plus 1.2 Jy 
survey; and (right) the Stromlo-APM survey. These are the same surveys whose red- 
shift correlation functions axe illustrated in Figure 3. For the QDOT -|- 1.2 Jy sample 
in the left panel, the dashed line is a fit to a model in which the linear redshift dis- 
tortion is modulated by a random exponentially distributed pairwise velocity dispersion 
(§7.2). For the Stromlo-APM survey in the right panel, the upper thick line shows the 
quadrupole-to-monopolc ratio which results when fingcrs-of-god are compressed, and the 
dashed line is a straight line fit to this line. The scales on the right show the value of 
/3 corresponding to the linear ratio Pi/Po^ equation (5.12), and the inferred value of 
Slo, equation (1.5), in the_ absence of bias, b = 1. Beware that the smoothing window 
VF(fc) oc A;^ exp[— (1.279fc/fc)^], equation (5.15), is broad, so the points are correlated; 
roughly every third error bar is uncorrelated. The horizontal and slanted bars on the 
(Icr) errors come from two separate ways of measuring uncertainties. 

velocity dispersion is crude compared to Fisher et aZ.'s (1994b) procedure, 
the number is nonetheless in good agreement with Fisher et aUs scale- 
dependent estimate (§8.2.2). As reported by Hamilton (1995), the best fit- 
ting value of the linear redshift distortion parameter in the QDOT -|- 1.2 Jy 
survey is (3 = OmtoH (Icr). 

For the Stromlo-APM survey, the nonlinear effects appear large enough 
that simple model-fitting to the quadrupole-to-monopole ratio seems a haz- 
ardous procedure. Certainly there would be a large covariance between P 
and the velocity dispersion, so that neither would be well measured. One 
alternative procedure is to deal with nonlinearities at a more fundamental 
level, by identifying individual fingers-of-god in the survey, and compress- 
ing them (Gramann, Cen & Gott 1994). Fingers can be identified using a 
friends-of-friends algorithm with a window elongated in the radial direction 
(e.g. Ramella, Geller &; Huchra 1989). Figure 7 shows the quadrupole-to- 
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monopole ratio in the Stromlo-APM survey that results from compressing 
fingers-of-god whose overdensities exceeded 10 when measured through el- 
hptical windows 10 times longer in the radial than transverse direction 
(so the overdensity of the compressed fingers exceeded 10 x 10 = 100; to 
avoid spurious fingers in distant, sparsely sampled regions of the survey, 
the transverse link length was also limited to less than 5/i~^Mpc). The 
result is encouraging in two ways: first, that the quadrupolc-to-monopole 
ratio is fiattened almost to a constant on all scales (at least until the un- 
certainties go out of control at tt/^ <i 50/i~^Mpc); and second, that the 
ratio is hardly changed at half-wavclcngths ix/kl^ 40/?,^^Mpc, suggesting 
that the ratio is close to linear at these scales. I choose to quote the value 
P = 0.30^0^1 (lo-) measured at a half-wavelength of ir/k = 20/i~^Mpc. 
Even though the statistical uncertainty is smaller at shorter wavelengths, 
the systematic uncertainty associated with compressing the fingers-of-god 
presumably becomes larger. 

That j3 is smaller for optical than IRAS galaxies is consistent with 
the standard interpretation in which /3 Ri Q,Q-^/b, and optical galaxies are 
positively biased relative to IRAS galaxies. The values of /? measured here 
indicate bopticai /biRAS ~ 2.3 with an uncertainty of a factor of 2 or 3. 

6.2. ANALYSIS 

The smoothing window used in Figure 7 is the power law times Gaussian 
window W{k) oc exp[— (1.279fc/fc)^] of equation (5.15), with index n = 2, 
the smallest n that allows the quadrupole to be measured. This choice of n 
yields the broadest smoothing window of the form (5.15), so the errors in 
the ratio PI (^)/-Po (^) of smoothed power spectrum harmonics are smallest 
for this n, although the results at different k are also then most correlated. 

A feature of this analysis is that, although it is the power spectrum 
that is being measured, all the calculations are done in real (redshift) space 
rather than in Fourier space. In measuring redshift distortions, it is im- 
portant to disentangle the true distortion from the artificial distortion in- 
troduced by a non-uniform survey window. In real (redshift) space, the 
observed galaxy density is the product of the true density and the selection 
function. In Fourier (redshift) space, this product becomes a convolution. 
Thus the natural place to 'deconvolve' observations from the selection func- 
tion is real space, where deconvolution reduces to division, and where the 
observations exist in the first place. The deconvolution procedure is 'exact', 
stable in practice, and admits a near minimum variance pair weighting. 

Specifically, the smoothed harmonics -P/(fe) of the redshift power spec- 
trum are computed, in accordance with equations (5.13) and (5.9), by tak- 
ing a suitably weighted sum over galaxy pairs of Wi{r), equation (5.16) for 
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n = 2, times Ve{iJ,r), the fth Legendre polynomial: 

Pi = {21 + 1) J Wiir)VeMe(.r,tir)d^r (6.1) 

in which the redshift correlation function ^''(r, at separation r and 
cosine angle nr = z.r to the line of sight z is estimated by Hamilton's 
(1993b) estimator 

, {DD)(RR) , 
nr,M = ^^^-l (6.2) 

where, following the conventional notation of the literature, D signifies data, 
and R signifies random background points, (although in practice all the 
background integrals here were done as integrals, not as Monte-Carlo inte- 
grals) , and the angle brackets ( ) represent near-minimum- variance weighted 
(Hamilton 1993b, eqs. [60] & [61]) averages over pairs at separation r and 
fir- The line of sight z is defined separately for each pair as the angular 
bisector of the pair, and to ensure the validity of the plane-parallel approxi- 
mation, only pairs closer than 50° on the sky are retained. Poisson sampling 
noise is removed by excluding self-pairs (pairs consisting of a galaxy and 
itself). 

Uncertainties in the ratio P2{k)/PQ{k) plotted in Figure 7 were esti- 
mated by two distinct methods, distinguished in Figure 7 by horizontal 
and tilted bars on the errors. Both methods take account of the correlated 
character of the galaxy distribution. 

The first method (horizontal bars) is described and justified in detail by 
Hamilton (1993b, §4). The procedure is to subdivide the survey into a few 
hundred volume elements, measure the fluctuation in the power attributable 
to each volume element, and form a variance by adding up the covariances of 
the fluctuations between the volume elements. However, there is an integral 
constraint that implies that the sum of all covariances between all volume 
elements is exactly zero, so it is necessary to truncate the sum at some point. 
The procedure adopted is to order the covariances in order of increasing 
distance between each pair of volume elements, and to take the variance 
to be the maximum value of the cumulative covariancc. The advantage of 
this method is that it makes minimal assumptions about the origin and 
nature of the errors. The main defect of the method is that it necessarily 
underestimates the variance on scales approaching the scale of the survey. 

The second method (tilted bars), inspired by Fcldman, Kaiser & Peacock 
(1994), is to take the Poisson (i.e. self-pair) variance of pairs ij weighted by 
[1 -I- n*(ri)PQ (^)][1 -I- n^{rj)PQ(k)]. Properly, this method is valid only for 
Gaussian fluctuations, and at wavelengths that are less than some fraction 
of the scale of the survey; moreover this estimate fails to take into account 
the fact that the covariance of the redshift power spectrum (fc) is different 



60 



A. J. S. HAMILTON 



in different directions of the wavevector k. However, tlie estimate is a useful 
check on the rehabihty of the uncertainties computed by the first method. 
The method may be expected to overestimate the variance somewhat on 
scales approaching the scale of the survey, and Figure 7 shows that it clearly 
underestimates the uncertainties in the nonlinear regime, where fluctuations 
are non-Gaussian. 

7. Translinear Regime 

Figure 7 indicates that nonlinearities affect the redshift power spectrum 
even at half- wavelengths as large asn/k « 40/i~^Mpc (although one should 
bear in mind that the influence of nonlinearity may be exaggerated in Fig. 7 
because of the broad smoothing window) (sec also Bromley, Warren & 
Zurek 1997). Studies of the ratio of redshift to real space power spectra 
(Fisher et al. 1993; Brainerd et al. 1996; see also: Suto Sz Suginohara 1991; 
Bahcall, Cen & Gramann 1993; Gramann, Cen 8z Bahcall 1993; Brainerd 
&: Villumsen 1993, 1994) lead to the same conclusion, that nonlinearity can 
affect the redshift power spectrum even at quite large scales. Moreover, 
there is good reason to push to smaller scales, to improve the statistics. 
Thus a good understanding of nonlinearities would appear essential to a 
reliable measurement of the linear distortion parameter (3. With such an 
understanding, it might even become possible to break the degeneracy in 
P ^ J7q^/6 between the cosmological density and bias b. 

7.1. LINEAR OR NONLINEAR? 

Before going any further, it is worth emphasizing that redshift distortions 
are more linear than they look. 

This can be seen from Figure 2 back at the beginning of this review. 
For example, a region that is just turning around in real space, a mildly 
nonlinear condition, appears to be collapsed to a sheet in redshift space. 
The edge of the sheet forms a caustic, a line of infinite density, a thoroughly 
nonlinear condition (note that only the boundary of the collapsed sheet 
appears at infinite density; the complete, infinite density, caustic surface of 
a collapsing overdensity is the envelope of the finger-of-god in Figure 1). 

This suggests that theories that go a step beyond the linear regime, for 
example the Zcl'dovich approximation (§7.3), or perturbation theory (e.g. 
Scoccimarro & Frieman 1996; Hivon et al. 1995), may prove quite successful 
in modelling redshift distortions in the translinear regime. 
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7.2. RANDOM ISOTROPIC PAIRWISE VELOCITY DISPERSION 

The most common approach to nonhnearity has been phenomenological 
rather than based on any rigorous theory. The approach has been to sup- 
pose that the nonhnear redshift correlation function is the result of con- 
volving the linear redshift correlation function with the line-of-sight 
component of a random isotropic pairwise velocity distribution f{v). In the 
plane-parallel approximation, which should normally be adequate for small, 
nonlinear scales, this model gives (with f{v) normalized to f{v)dv = 1) 

/■CO 
CL{r//-v,r±)f{v)dv . (7.1) 
-oo 

Note that this model does not require that the individual galaxies of a pair 
should separately have random isotropic velocity distributions, although 
some authors have found it convenient also to assume the latter (e.g. Pea- 
cock &; Dodds 1994, to allow them to consider cross-correlations between 
different populations; or Heavens & Taylor 1995, to allow them to deal 
with radial rather than plane-parallel redshift distortions). Convolution in 
real space becomes multiplication in Fourier space, so the redshift power 
spectrum in this model is the product of the linear redshift power spec- 
trum P'l{k) = (1 + j3fi'^)'^P{k) with the linc-of-sight Fourier transform 

/(^//) = S'^oof{v)e^^//'"dv (note k// = kfij^) of the velocity distribution: 

P^(fc) = Kkfik)Pl{k) = />A^fe)(l + • (7-2) 

Those authors who have invoked this model have generally adopted 

either a Gaussian velocity distribution (Peacock & Dodds 1994; Heavens 
& Taylor 1995; Tadros & Efstathiou 1996), for which f{v) = [{27r)^/^a]-^ 
exp[-(w/cr)^/2] and 

/>Mfc) = exp[-(aA:/Xfc)2/2] ; (7.3) 

or else an exponential pairwise velocity distribution (Fisher et al. 1994b; 
Hamilton 1995; Ballinger, Peacock & Heavens 1996; Peacock 1997; Brom- 
ley, Warren & Zurek 1997; Ratchffe et al. 1997; Cole, Fisher k Weinberg 
1995 use an exponential single-particle distribution, equivalent to a pair- 
wise distribution that is the convolution of two exponentials), for which 
f{v) = (2VV)-iexp(-2V2|^;|/a) and 

The velocity dispersions a above are, after the convention established by 
Peebles (1976, 1980 eq. [76.14]) and Davis &; Peebles (1983), 1-dimensional 
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pairwise velocity dispersions, which are 2^/^ larger than a random 1-dim- 
ensional single-point (galaxy) dispersion, and 3^/^ smaller than a 3-dimen- 
sional pairwise dispersion. 

The exponential pairwise velocity distribution was first proposed as a fit 
to observations by Peebles (1976), and has held up as a good approximation 
both in observations (Davis & Peebles 1983, CfAl; Fisher et al. 1994b, 
1.2 Jy survey; Marzkc et al 1995, CfA2 + SSRS2; Lin 1995, LCRS), and 
in iV-body experiments (Fisher et al. 1994b, Fig. 5; Zurek et al. 1994, 
Fig. 7), although the latter reveal appreciable skewness in the distribution 
of pairwise infall velocities. 

7.3. ZEL'DOVICH APPROXIMATION 

The Zel'dovich approximation (e.g. Hui & Bertschinger 1996) is essentially 
linear theory expressed in Lagrangian space. In linear theory, particles 
(galaxies) move in straight lines from their initial comoving positions r^, 
with comoving displacements Ar = D{t)ArL growing in proportion to the 
linear growth factor D{t) 

r{t) = ri + Dit)ArL{ri) (7.5) 

where Ari = — VV~^(^l is the linear displacement field, with 5l the linear 
overdensity, evaluated at some suitably early epoch. The Zel'dovich approx- 
imation supposes that the linear result (7.5) remains true also at nonlinear 
epochs. By construction, the Zel'dovich approximation satisfies the conti- 
nuity equation, but it fails to satisfy the Euler equation beyond the linear 
regime. 

Redshift distortions in the Zel'dovich approximation have been studied 
by Fisher k Nusscr (1996), Taylor & Hamilton (1996), and Hatton & Cole 
(1997). All three sets of authors tested the predictions of the Zel'dovich ap- 
proximations against iV-body simulations, and all three adopted the plane- 
parallel approximation and chose the ratio of quadrupole to monopole har- 
monics of the redshift power spectrum as the test statistic. Fisher & Nusser 
and Hatton & Cole examined iV-body simulations with CDM-like power 
spectra, while Taylor & Hamilton examined simulations with power law 
power spectra, P{k) oc /c" with n = —1, —1.5, and —2. 

The conclusions can be summarized as follows. The quadrupole-to- 
monopole ratio P2{k)/PQ{k) as a function of k can be characterized by 
a shape, an overall amplitude, and an overall scale A;o that is conveniently 
taken to be the zero-crossing of the quadrupole power, PI (^o) = 0. Then: 

• The shape of the quadrupole-to-monopole ratio depends on the shape 
of the power spectrum, but is insensitive to the redshift distortion pa- 
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rameter /3. This shape is accurately predicted by the Zel'dovich approx- 
imation at least down to the zero-crossing of the quadrupole, k ^ ko. 

• The overall amplitude of the quadrupole-to-monopolc ratio depends 
on the linear rcdshift distortion parameter /? according to the usual 
linear formula (5.8). 

• The zero-crossing scale /cq of the quadrupole is not reliably predicted 
by the Zel'dovich approximation. 

Actually, Fisher & Nusser argued that the shape of the quadrupole-to- 
monopole ratio is a universal function that is insensitive to the shape of 
the power spectrum. Within the range of CDM-like power spectra this is 
probably a reasonable approximation. However, the investigations of Taylor 
& Hamilton showed that if a broader range of power spectra is considered, 
then there is a dependence on the shape of the power spectrum which, 
according to the Zel'dovich approximation, becomes marked as the spectral 
index n —3. There is general agreement that the shape of the quadrupole- 
to-monopole ratio is insensitive to P, the latter serving simply to set the 
overall normalization of the ratio. 

The third conclusion, that the Zel'dovich approximation does not pre- 
dict the zero-crossing of the quadrupole power reliably, is based mainly on 
the results of Taylor &; Hamilton, but the conclusion seems to be consis- 
tent with the results graphed by Hatton Sz Cole (Fisher & Nusser offer no 
information on this point). It is not entirely clear what is responsible for 
the discrepancy in the zero-crossing. 

Fisher &; Nusser and Taylor & Hamilton both interpreted the agree- 
ment between the predicted Zel'dovich and measured A''-body shape of the 
quadrupole-to-monopole ratio as suggesting that the departure of this ratio 
from a constant value in the translinear regime is caused not by random 
velocities in nonlinear virialized fingers-of-god, but rather by coherent in- 
fall towards collapsing clusters. This is encouraging if true, because it lends 
hope that redshift distortions can be modelled accurately with more rigor- 
ous theoretical treatments in the translinear regime. 

7.4. FUTURE WORK 

The shortcomings of modelling nonlinearity in redshift distortions either 
with random pairwise velocities, or with the Zel'dovich approximation, have 
been emphasized by Hatton & Cole (1997). It is clear that a more thorough 
and rigorous treatment of redshift distortions in the translinear regime, for 
example using perturbation theory (e.g. Scoccimarro &: Prieman 1996), is 
needed. 

As usual, bias complicates the problem. One clear bias is that highly 
nonlinear fingers-of-god are more prominent in optical and elliptical pop- 
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ulations than in IRAS and spiral populations (Fig. 3; Guzzo et al. 1997). 
Long fingers in rich clusters appear stretched to large separations in red- 
shift space, contaminating the linear and translinear regime. Perhaps this 
problem can be side-stepped by identifying and compressing prominent 
fingers (§6.1; Gramann, Cen k, Gott 1994). Or perhaps estimators can be 
constructed that arc somehow orthogonal^^ to the nonlinear fingers, as sug- 
gested by Bromley, Warren &; Zurek (1997; these authors claimed to do just 
this, but actually their estimator, based on eq. [8.24] in §8.2.3 below, merely 
corrects for nonlinearity, and is not orthogonal to it). 

8. Measurements of (3 from Linear Redshift Distortions 
8.1. COMPILATION 

Table 8.1 lists measurements of the distortion parameter (3 from redshift 
distortions in the linear regime. The table is intended to be complete up to 
the first half of 1997. Compare this compilation to that of Strauss &; Willick 
(1995, Table 3, p. 412). Note that Table 8.1 does not include measure- 
ments of (3 from direct measurements of peculiar velocities, which Strauss 
k, Willick do include. 

The Table is subdivided into the three principal methods described §5. 
However, precise procedures used by different authors differ in many re- 
spects, and the threefold categorization is not as clean as the headings 
might suggest. Comments on the individual measurements are given in the 
next subsection, §8.2. 

The average and standard deviation of values listed in Table 8.1 is 
/^optical = 0.52 lb 0.26 for optical galaxies, and (3iras = 0.77 ± 0.22 for 
IRAS selected galaxies (the quoted uncertainty is the deviation of a sin- 
gle measurement; the lower limit from Taylor k. Hamilton was omitted). 
In averaging the numbers, I felt it was better to assign equal weight to 
all measurements, rather than to rely either on the authors' quoted un- 
certainties or on my own judgement of their quality. Of course the data 
sets — especially the IRAS data — arc not all independent of each other, 
so the standard deviation rcfiects differences in methods as much as any- 
thing else. All the same, the dispersion of the measurements turns out to 
be about the same as the uncertainties that people quote, so is perhaps 
not unreasonable. If the difference in optical and IRAS /3's is attributed 
to bias, it implies a relative bias of &opticai/^ffiA5 ~ 1-5, consistent with 
the ratio feopticai/^ffivis ~ 1-3 measured from the square root of the ratio 
of their power spectra (Peacock k Dodds 1994; Peacock 1997, Fig. 2). If 

^^A nice example of what is meant here by orthogonality is the case of the angular 
correlation function or angular power spectrum, which is orthogonal to — i.e. completely 
unaffected by — redshift distortions. 
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Survey (5 Reference 



1. Ratio of redshift space to real space power spectra (plane-parallel) 



CfA 


0.53 ±0.15 


Pry & Gaztanaga (1994) 


SSRS 


1.10 ±0.16 




IRAS 


0.84 ± 0.45 




IRAS 


1.0 ±0.2 


Peacock & Dodds (1994) 


Optical 


0.77 ±0.15 




Stromlo-APM 




Baugh (1996) 


Stromlo-APM 


0.48 ±0.12 


Loveday et al. (1996a) 


Stromlo-APM 


0.20 ± 0.44 


Tadros & Efstathiou (1996) 


Optical 


0.40 ±0.12 


Peacock (1997) 


Durham/UKST 


0.52 ± 0.39 


Ratcliffe et al. (1997) 



2. Ratio of quadrupole to monopole redshift power (plane-parallel) 



IRAS 2 Jy 




U.D»_Q 24 


Hamilton (1993a) 


Perseus-Pisces 


0.75 ±0.3 


Bromley (1994) 


IRAS 1.2 Jy 


0.451°:?^ 


Fisher et al. (1994b) 


IRAS 1.2 Jy 


0.52 ±0.15 


Cole, Fisher & Weinberg (1995) 


QDOT 


0.54 ±0.3 




LCRS 


0.5 ±0.25 


Lin (1995) 


QDOT ± IRAS 1.2 Jy 


0.69«;- 


Hamilton (1995) 


IRAS 1.2 Jy 


0.6 ±0.2 


Fisher & Nusscr (1996) 


QDOT ± IRAS 1.2 Jy 


> 0.5 (95% confidence) 


Taylor & Hamilton (1996) 


IRAS 1.2 Jy 




Bromley et al. (1997) 


Durham/UKST 


0.48 ±0.11 


Ratcliffe et al. (1997) 


Stromlo-APM 


o-3ol°o:lI 


This paper, §6 



3. Maximum Likelihood (radial) 



IRAS 1.2 Jy 0.961^1^ Fisher, Scharf & Lahav (1994) 

IRAS 1.2 Jy 1.1 ± 0.3 Heavens & Taylor (1995) 

IRAS 1.2 Jy 1.04 ± 0.3 Ballinger, Heavens & Taylor (1995) 



optical galaxies are unbiased, ^optical = !> then the inferred value of the 
cosmological density is Q,q = O.SSIq ^ • If on the other hand IRAS galaxies 
are unbiased, hiRAS = 1) then fio = 0.63lo;27 • 
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8.2. COMMENTS ON INDIVIDUAL MEASUREMENTS 

The comments in this subsection are arranged in approximately the chrono- 
logical order in which the measurements were published, except that mea- 
surements by the same author or group of authors are kept together. 

8.2.1. Hamilton et al. 

Hamilton (1993a) measured /3 in the IRAS 2 Jy survey (Strauss et al. 
1992a) from a ratio of quadrupole-to-monopole correlation functions, weigh- 
ted with pair separation in the fashion proposed by Hamilton (1992): 

where Coir) = ^r'^ lo ^ois)s'^ds. Cole, Fisher & Weinberg (1994, eq. [2.17] 
and Appendix B) subsequently pointed out that Hamilton's ratio (8.1) was 
actually just a ratio of smoothed quadrupolc to smoothed monopole power, 
equation (5.12), in which the smoothing function is taken to be the second 
spherical Bessel function, W{k) = jiikr) in equation (5.11). This smoothing 
window vanishes at zero wavenumbcr, W{k) = at A; = 0, immunizing the 
estimate against uncertainty in the mean density. 

Hamilton (1995) and the present paper, §6 and Figure 7, use essentially 
the same procedure as Hamilton (1993a), but with the smoothing function 
taken to be a power law times a Gaussian, equation (5.15) with n = 2. 

A principal topic of Hamilton (1995) was the presentation of evidence 
of a large, by the look of it systematic, difference in clustering between the 
near and far regions of the QDOT sample, which Hamilton suggested might 
arise from some systematic in the IRAS PSC fluxes. To date no satisfactory 
explanation of this problem has emerged. 

Taylor & Hamilton (1996) studied the ZclVlovich approximation as a 
means of approximating redshift distortions in the mildly nonlinear regime 
(see §7.3). As an application, they fitted the ratio of quadrupole-to-monopole 
power measured by Hamilton (1995) from the IRAS QDOT -|- 1.2 Jy sur- 
vey, the same data as arc plotted in Figure 7. They concluded that (3 > 0.5 
at the 95% confidence level, using a full covariance matrix estimated from 
the data. 

8.2.2. Fisher et al. 

The paper by Fisher et al. (1994b) was published after that of Fisher, Scharf 
& Lahav (1994), but the former seems to be the intellectual predecessor, 
so here it appears first. 

Fisher et al. 's (1994b) paper on the IRAS 1.2 Jy redshift survey is es- 
pecially useful because of its careful study of nonlinear effects through Cold 
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Dark Matter (CDM) A^-body simulations engineered to the statistical prop- 
erties of the 1.2 Jy survey. They pointed out that the redshift correlation 
function along the line of sight of sight axis, ^*(r//,0) with r± = 0, gives a 
clear signature of the functional form of the pairwise velocity distribution, 
and find that an isotropic exponential pairwise velocity distribution fits the 
IRAS data well — indeed better than a more detailed anisotropic model 
that they tried. 

Fisher et al. measured the linear distortion parameter /3 from the statis- 
tic 

dln^r) _ 3vMr)[l+ar)] 
^ aina2 2r^(r) ^ ' 

where Vi2{f) is the mean pairwise infall velocity at separation r, and ^(r) = 
3r~^ Jq $,{s)s'^ds. The correlation functions ^(r) and ^(r) in equation (8.2) 
are in real space, not redshift space, and Fisher et al. adopted the power law 
fit ^(r) = (r/3.76 /i~^Mpc)^'^^ to the IRAS real space correlation function 
estimated by Fisher et al. (1994a) from the angular correlation function. 
The last equality in (8.2) is just the pair conservation equation (Peebles 
1980, eq. [71.7]), true in both linear and nonlinear regimes, while the first 
equality in (8.2) is true in the linear regime. Fisher et al. showed that the 
statistic (8.2) gave a reliable measure of /? at separations r ^ 10/i~^Mpc in 
three CDM simulations, one flat and unbiased, one flat and biased (6 = 1.6), 
and one open and unbiased. 

To determine the infall velocity Vi2{r) in the estimator (8.2), they mod- 
elled the pairwise velocity distribution as an isotropic exponential with 
mean infall velocity vi2{r) and dispersion cr(r), the shape (but not ampli- 
tude) of these two velocities as a function of separation r being determined 
in essence by the CDM simulations, but scaled and adjusted to match the 
power law fit to the IRAS real space correlation function. To determine 
the overall amplitudes of the infall velocity vi2{r) and dispersion cr{r), they 
performed least squares fits to two functions that they measured from the 
data at separations up to 16/i~^Mpc: the quadrupole-to-monopole ratio 
of correlation functions ^2('')/^o('')! ^^'^ line-of-sight correlation func- 
tion '^*(r//,r^) with r_L < 2/i~^Mpc. In carrying out the fits, they used 
a full covariance matrix estimated from the simulations. The measured 
pairwise velocity dispersion (with scale-dependence fixed by their proce- 
dure) was cr(r) = 317^49 km s^^ at a separation of r = l/i~^Mpc, rising 
to ~ 400kms~^ at r = 4/i^^Mpc, then declining again to ~ 350ft.^^Mpc 
at r = 10/i~^Mpc (Fisher et al., Fig. 3). The final result from the estima- 
tor (8.2) was P = 0.45^^:?^. 

Fisher, Scharf & Lahav (1994) were the first to study radial redshift 
distortions without using the plane-parallel approximation, and the first 
to apply a maximum likelihood (ML) formalism to measure /? (albeit with 
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a rather crude choice of radial density modes). They expanded the IRAS 
1.2 Jy density field in spherical harmonics about the observer (us), window- 
ing the density in the radial direction with Gaussian windows centred at 
four depths, 38, 58, 78, and 98/i~^Mpc, each with a dispersion of 8/j~^Mpc. 
To make spherical harmonics uncorrelated requires full sky coverage, and 
the 1.2 Jy survey has the asset that it already covers almost all the sky 
at galactic latitudes |6| > 5°. While it is possible to take into account the 
coupling of harmonics induced by incomplete sky coverage. Fisher et al. 
chose instead to complete the sky coverage of the 1.2 Jy survey by interpo- 
lating over the plane of the Milky Way in a way that smoothly continued 
structure. They worked in the Local Group (LG) frame (§4.3.1), and the 
equations below do likewise. 

In the course of their analysis. Fisher et al. did a cute trick, which is 
worth pointing out here. Let a be the rcdshift space galaxy density n^{s) 
in the LG frame, smoothed over some window /(s) (superscripts LG's on 
s^'^ and n'^^ are omitted for brevity): 



The second equality in equation (8.3) is true because galaxies are conserved, 
equation (4.25), and the last approximation, just a Taylor expansion of 
/(s), is valid for windows f{s) that are sufficiently smooth in the radial 
direction. Fisher et al. emphasized the point that the selection function 
n{r) in a flux-limited survey is in fact a function of the true distance r 
to a galaxy, not of its redshift distance s (see footnote^^). Equation (8.3), 
which is essentially an integration by parts, recasts the effect of redshift 
distortions into the window f{r) rather than the selection function and the 
density, which seems neat. 

Actually, the redshift distortion is still there in the last expression of 
equation (8.3), lurking in the peculiar velocity v — r.v^'^ . In the linear 
regime, the velocities v and v^^ are given as usual by equations (4.31) and 
(4.43), and equation (8.3) reduces to 



It remains necessary nonetheless to estimate the real space selection function some- 
how, in order to evaluate the quantities (8.7) and (8.8). In practice Fisher et al. used the 
selection function n°{r) estimated in rcdshift space, and discarded all monopole (t = 0) 
modes. This is fine for an all-sky survey — see the final paragraph of §4.4.3. 



a 




(8.3) 




(8.4) 
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where it has been assumed that / f{r)n{r)d?r = 0. The latter is true for all 
the windows considered by Fisher et al, and is a desirable feature in any 
case to immunize measurements against uncertainty in the mean density. 

Fisher et al. took the spherical transform of the redshift space density 
folded through Gaussian radial windows, which corresponds to choosing 
windows femif) in equation (8.3) (the subscript im is appended to distin- 
guish the windows) of the form 



femir) = f{r)Yemir) 



(8.5) 



with /(r) oc e~[^^~''°^/'^]^/^. They showed from equation (8.4) that, for an all- 
sky survey, the resulting spherical transform aim could be written (Fisher 
et al. eq. [8]) 



wi{k) + pwf{k) 6im{k) edk/{27r) 



.6) 



where Sim{k) is the spherical transform (3.7) of the unredshifted overden- 
sity, we{k) is 

/■oo 

m(,k) = i^Air / jt{kr)f{r)n{r) r^dr , (8.7) 
and w^{k) is a correction term embodying the redshift distortions: 



rc 

wf{k) = i^in 

Jo 



j'iikr) - Isf, 



dfir] 
kdr 



n{r) r^dr 



(8.8) 



with j[{x) = dji{x)/dx the derivative of the £'th spherical Bessel function. 
The Kronecker delta term \5f^, which subtracts from the dipole (£ = 1) 
term jj(fcr) its value at r = 0, is the term that arises from the motion of 
the LG. 

It is important in equation (8.6) that the survey covers the entire sky. 
If the survey were not all-sky, then the functions W(,{k) in equation (8.6) 
would be replaced by matrices w^^J" {k), which act by matrix multiplication 
on the spherical transform di'm' (k) of the overdensity. Only in the case of 
an all-sky survey does the matrix w^j^ {k) become diagonal; the quantities 
we{k) in equation (8.6) are then the eigenvalues of the matrix. 

For an all-sky survey, statistical isotropy implies that the are un- 
correlated, with variances from equation (8.6) 



ilaemf) = / Mk) + Pwfik) P{k) k^dk/{2'Kf + Nt 



(8.9) 



where N^jn is the Poisson sampling noise, which is just the (expected) self- 
galaxy contribution to ([a^^p). Fisher et al. used four separate Gaussian 
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radial windows. Here there are also covariances (a^^a^^) between the differ- 
ent radial windows i and j, but still only covariances with the same £m are 
non-zero. To determine f3, Fisher et al. combined the observed amplitudes 
measured in the 1.2 Jy survey for ^ = 1 to 10 into a likelihood function, 
adopting as prior an unredshifted power spectrum P{k) with fixed shape 
corresponding to a CDM model with V = 0.2 (Efstathiou, Bond &; White 
1992), and fixed amplitude fjg = 0.69 it 0.04, as determined by Fisher et 
al. (1993). The final result was (3 = 0.96^o is ' which is the value quoted in 
Table 8.1. 

Fisher et al. also quote (in proof) a result reported by Fisher (1993), in 
which the likelihood analysis is done with the shape parameter T as well 
as (3 treated as a free parameter, with the result (3 = 0.94 it 0.17 and F = 
0.17 lb 0.05. The uncertainty here is the conditional la uncertainty, which 
comes from projecting the Aln£ = —0.5 contour on to the parameter axes. 
If the amplitude a% is also treated as a free parameter, then (5 = 0.47 ±0.25, 
r = 0.15±0.05, and erg = 0.81 it 0.06. There is a strong anti-correlation 
between /? and cts, and Fisher (1993) argues that the more reliable value of 
(5 is probably the higher one obtained when erg is set to the value measured 
from the real space correlation function. 

It is instructive to derive a more general version of equation (8.6). Let 
w{r) denote the product of a (complex- valued, in general) window f{r) 
(not necessarily of the form [8.5]) with the selection function n{r) (not 
necessarily all-sky) 

w*{r) = f{r)n{r) . (8.10) 

Assume, following Fisher et al., that J w{r)cfr = 0, always a good idea 
in any case to protect against uncertainty in the mean density. Then the 
windowed overdensity a, equation (8.3) is, for linear distortions, 

a = J w*{r)S'{r)d\ = j w*{r)S^^S{r) d\ = j [s^^^wir)]* S{r) d^r 

(8.11) 

where S^*^ is the linear redshift distortion operator in the LG frame, equa- 
tion (4.46), and S^^^ is its Hermitian conjugate, equation (4.50). It follows 
that the windowed density a is 

a = j[w{r) + pw'^{r)y6{r)d^r (8.12) 

where {r) is the result of the distortion term (the part proportional to 
f3) in S^*-'''', equation (4.50), acting on the window w{r): 



dr \dr 



a 



f d 



r^- — .— 

.2 Qj, 



r=o 



V ^a(r)r 



w{r) . 
(8.13) 
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The correction term (8.13) can be rewritten in a variety of ways, including 
one of the same ilk as Fisher et a/.'s expression, equation (8.8), 

The r/r^ term in the integrand, which subtracts from (r — r')/\r — r'p 
its value at r' = 0, is the term that arises from the motion of the LG. 
Recasting the integral (8.12) in spherical transform space, equations (3.7), 
(3.9), and (3.33), gives 

' r n * 

J2[wimik) + Pwg,{k)\ kmik) k'dk/ (27, f (8.15) 

which generalizes equation (8.6). Equation (8.13) reveals the nature of the 
correction term it;'^, and it is readily confirmed that Fisher et al.^s re- 
sult (8.8) is regained if the window is chosen to be w*{r) = /(r)l£m(f )n(r) 
in an all-sky survey, where the selection function n(r) is the same in all 
directions over the sky. 

The same derivation can be written compactly in terms of vectors and 
matrices in Hilbert space (see §3.3) 

a = wU' = w^S^^S = [S^^^wYS ={w + pw^)U . (8.16) 

Here w is a weighting function, a vector, S'"*^ is the linear distortion oper- 
ator (4.46) in the LG frame, a matrix, and S is the vector of (unredshifted) 
overdcnsities. Equation (8.16) generalizes easily to the case of an array of 
quantities: the scalar a in equation (8.16) simply becomes a vector a, and 
the weighting vector w becomes a matrix. The covariance matrix of such 
an array is 

(aa^) = {w + l3w^^)^^{w + pw^) + N (8.17) 

where ^ is the (unredshifted) power spectrum matrix, and N is the Poisson 
sampling noise matrix, the self-galaxy contribution to (aa^). 

8.2.3. Bromley et al. 

Bromley (1994) proposed the idea of measuring j3 from a ratio of variances 
of the redshift space density windowed through anisotropic sampling func- 
tions. Consider the statistic 5^ which is the value of the density filtered 
through some window (sampling function) w{r) randomly positioned in 
the redshift survey 

~5' = j w{r)S'{r)d\ = J w{r)^^J^d^r (8.18) 
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where the last equahty is true as long as the window is chosen to have 
vanishing volume integral, / w{r)d^r = 0, as was true for the windows 
considered by Bromley. The real space integral in (8.18) can be rewritten 
as a Fourier space integral 

5' = j w*{k)5%k)d^k / {277 f (8.19) 

where w{k) and 6^{k) are the Fourier transforms of the sampling function 
and overdensity. It follows that the expected shot-noise-subtracted variance 
of the filtered density is equal to the redshift space power spectrum 
P*(fe) folded with the power spectrum |t()(fc)p of the sampling function 

ps = ((^^)2^ - ^ = / d^k/{2TTf (8.20) 

where is the shot noise, the self-pair contribution to ((^*)^). Shifting 
the window w{r) to another random position in the survey simply changes 
the phase of w{k), which leaves the variance (8.20) unaltered. Bromley 
now proposed to choose windows w{r) that were the gradient along some 
unit direction n of some spherically symmetric function R{r), equivalent in 

Fourier space to in.k times its Fourier transform R{k) = J R{r)e^^ '''d^r: 

w{r) = n.VR{r) , w{k) = in.kR{k) . (8.21) 

If the unit directions n are chosen to be either parallel to or perpendicular 
to the line of sight to the window, then the ratio of parallel to perpendicular 
variances is (Bromley eq. [9]; see also Gramann, Cen & Gott 1994, eq.[10], 
which however contains an error) 

^ Jf,lW{k)P^{k)d^k ^ l + 

PI kli^-I^VW{k)Ps{k)d^k l + |/3+A^2 ) 

where the last equality follows from Kaiser's equation (4.36), and the smooth- 
ing window is W{k) = k'^\R{k)\'^. If the parallel and perpendicular variances 
are measured from many random samplings in a redshift survey, then their 
ratio P/'//Pj_ should yield a measure of /3. 

Bromley applied this idea to the Giovanelli & Haynes (1991, and ref- 
erences therein; Wcgner, Haynes &: Giovanelli 1993) redshift survey of the 
Pisces-Perseus region, which is complete to blue magnitude niB = 15.7 and 
contains over 4000 galaxies. He used a quartic window R = [1 — (r/A)^]^, 
with A = 14-17 /j-^Mpc. The value /? = 0.75 ± 0.3 listed in Table 8.1 is 
measured from Bromley's Figure 2 at A = 16/i~^Mpc, a length scale that 
he states gave the best results in GDM simulations of the procedure. 
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It is useful to relate Bromley's statistic (8.22) to the harmonics of the 
power spectrum. Both numerator and denominator are sums of monopole 
and quadrupole harmonics of the power spectrum filtered through the win- 
dow W{k) = k^\R{k)\^, equation (5.11), 

fjL = fS-^fX . (8.23) 

ps ps 1 ps ^ ' 

^± M) 5-^2 

Thus the procedure is effectively equivalent to measuring a quadrupole-to- 

monopole ratio of smoothed power spectra. 

Bromley, Warren & Zurek (1997) developed Bromley's (1994) procedure 
further, using a neat trick to correct for nonlinearity. The paper is notable 
also for reporting redshift distortions from two large (17 million particle) 
high resolution CDM simulations with 0, = 1 and Hq = 50km/s/Mpc, 
one 125/i^^Mpc on a side, the other 500/i~^Mpc on a side. The 1-dim- 
ensional pairwise velocity dispersion of the simulations was 1100 km s~^ 
and 850kms~^ (at a separation of 1 /i~^Mpc) respectively for the mass 
and the haloes ('galaxies'), considerably higher than observations. However, 
Bromley et al. were able to engineer a set of haloes with velocity dispersion 
384/i~^Mpc and clustering properties consistent with those of the IRAS 
1.2 Jy sample (Brainerd et al. 1996) by the device of eliminating all objects 
that were within 3 Abell radii of the highest density peaks and that had 
peculiar velocities greater than 750kms~^. Such culling is, as they say, 
'rather contrived', but it is true that IRAS galaxies are conspicuous by 
their absence from rich clusters such as the Coma cluster, and it is possible 
that nature pulls a similar trick. 

Bromley et al.'s idea for nonlinearity was to modify the anisotropy of 
sampling functions in a way designed to cancel nonlinearity. Suppose that 
WL{k) are a set of 'linear' sampling functions, and suppose that nonlinearity 
can be modelled with a random isotropic pairwise velocity dispersion f{v), 
as in §7.2. Then the sampling functions can be 'corrected' for nonlinearity 
by dividing them in Fourier space by the square root of the line-of-sight 
Fourier transform /(A://) (note k// = kjij^) of the velocity dispersion: 

w{k) = . (8.24) 

Bromley et al. assumed the exponential model for the pairwise velocity dis- 
tribution, equation (7.4), which appears to give a good description of pair- 
wise velocities in the 1.2 Jy survey (Fisher et al. 1994b). With this choice, 
and with the linear sampling functions chosen as before, equation (8.21), 
to be the parallel and perpendicular gradients of a spherically symmetric 
function, the expected shot-noise-subtracted variances (8.20) of the filtered 
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densities are equal to the redshift space power spectrum folded through the 
functions 

= l(l-4)(l + la2A;Vfc)fc'l^(fc)r (8.25) 

The factor (1 + ^cj^fc^/i^) 'corrects' the redshift power spectrum for non- 
linearity, equations (7.2) and (7.4), so the resulting ratio Py/P^ of parallel 
to perpendicular variances should be equal to the linear ratio (8.22). Once 
again, as in (8.23), it is instructive to express this ratio in terms of har- 
monics of the smoothed power spectrum: 

PS + f p| + aH^pr' + jpi^^ + ife^r^) 

ps ps _ i ps I „2( i psNL 1 1 psNL 2 psNL\ ^ ' ' 

where P^^^ are the harmonics of the power spectrum smoothed over the 
window k'^W{k) = k'^\R{k)\'^ instead of W{k) (cf. eq. [5.11]), 

/•oo 

Pf^= W{k)Piik)4nkUk/{2Trf . (8.27) 

^0 

The nonlinear correction is equivalent to replacing 'linear' estimators of 
the monopolc and quadrupole harmonics of the redshift power spectrum 
by 'nonlinear' estimators 

P^ - P^ + a'^lP^'^' + ^^Pi''') 

Pi - Pi + {^Pt' + ^^Pi""' + l^r^) ■ (8.28) 

Two comments can be made about the nonlinear correction. Firstly, it 
involves incorporating judicious quantities of monopole, quadrupole, and 
hexadecapole terms into the linear estimators of the parallel and perpen- 
dicular (or monopole and quadrupole) power, which seems like a simple 
and stable procedure. Secondly, the nonlinear correction terms involve the 
power spectrum multiplied by an extra factor of k'^, equation (8.27), so that 
the correction becomes larger further into the nonlinear regime, at larger 
k, as one might expect. 

Bromley et al. applied their procedure both to their TV-body simula- 
tions, and to the IRAS 1.2 Jy survey. For the spherical function R{r), 
they adopted a Gaussian R{r) = e~^^^'^^ in place of Bromley's (1994) 
quartic. As it happens, the resulting (linear) smoothing window W{k) = 
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k'^\R(k)\^ cx /e^e~(*^'^)^ coincides with that of Hamilton (1995), which is the 
same smoothing window used in Figure 7, and the nonUnear smoothing 
window k'^W{k) oc k^e~^^^^ also belongs to the class of windows specified 
in equation (5.15). 

The simulations gave good agreement with expectation. For the 1.2 Jy 
survey, Bromley et al. assumed a velocity dispersion of a = 320kms~^, 
which is a reasonable approximation to the (scalc-dcpcndcnt) velocity dis- 
persion measured by Fisher et al. (1994b). The final result was f3 = O.S^q's • 

8.2.4. Cole, Fisher et al. 

Cole, Fisher & Weinberg (1994, 1995) wrote a pair of papers, in the first of 
which they described and tested with AT-body simulations a procedure to 
measure the redshift power spectrum in the plane-parallel approximation, 
and in the second of which they applied their procedure to measure (5 from 
the quadrupole-to-monopole ratio of power in the IRAS 1.2 Jy and QDOT 
surveys. 

To ensure the validity of the plane-parallel approximation, they first 
windowed the surveys through sets of spherical bowler hat windows w{\r — 
rd) (a bowler is a top hat convolved with a Gaussian), centred at random 
positions Vc sufficiently far from the observer that the windows subtended 
an opening angle no greater than 50°. Cole et al. (1994, Fig. 8) showed 
that at this opening angle and using their procedure, the plane-parallel ap- 
proximation causes /? to be underestimated by approximately 5%. They 
defined the line of sight to each window as the direction to its centre 
Vc. In the plane-parallel approximation, the shot-noise-subtracted power 
spectrum P^{k,iij^) measured through a window w{\r — rd) (normalized 
to / w{r)d^r = 1) is the convolution of the true redshift power spectrum 
P*(/c, fif^) with the power spectrum 11^(^)1^ of the window (Cole et al. 1994, 
eq. [3.4]), 

P\k,n^) = j \w{\k-k'\)\^ P'{k',fif^,)(fk'/{2Trf . (8.29) 

It follows (Cole et al. 1994, eq. [3.5]) that the harmonics P^ik) of the power 
spectrum measured through the window are equal to the harmonics P^ik) 
of the actual power spectrum convolved with the harmonics Wi{k,k') of 
the power spectrum of the window: 

POO 

P!{k)= We{k,k')P^{k') Ank'^dk'/ {2Trf (8.30) 

^0 

where 

\w{\k' -kDf = Y,i'^^+^)'Pdk.k')Wi{k,k') . (8.31) 
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The spherical symmetry of the window ■w{r) is essential here to ensure that 
the £'th harmonic of the true power spectrum maps only to the £'th har- 
monic of the observed power spectrum. The fact that the kernels Wi^k, k') 
are different for different harmonics i means that the ratio PI (^)/-f(f (^) of 
measured quadrupole-to-monopole harmonics no longer satisfies the simple 
formula (5.8). However. Cole et al. (1994, Table 1 and Fig. 5) showed that 
the measured ratio (^)/^o (^) is equal to the true ratio P^ (A;)/Po (/c) mul- 
tiplied by a correction factor « 0.7 which, although appreciably different 
from unity, is nevertheless insensitive to the shape of the power spectrum. 
In practice they adopted correction factors appropriate for the F = 0.25 
CDM spectrum of Efstathiou, Bond & White (1992). 

The Fourier transform w{k) of the bowler hat window is the product of 
the first spherical Bessel function ji(fcPsph) with a Gaussian, where Pgph 
is the radius of the top hat. Cole et al. used only wavenumbers k for which 
kRsph are zeros of ji(A;i?sph) (in practice they used only the first two zeros, 
at kRsph = 4.49 and 7.72). With this choice, the integrand in equation (8.29) 
vanishes at fe' = 0, which eliminates leakage from power at zero wavcvec- 
tor, so immunizing the measurement of power against uncertainty in the 
mean density (which makes a delta-function contribution to power at zero 
wavevector). This clever trick was used first by Fisher et al. (1993). 

Cole et al. (1995) combined their measurements of the redshift power 
spectrum P^{k,Hf^) from bowler windows at different depths using an 
inverse- variance weighting, the variance of the power from each window be- 
ing estimated in the manner of Feldman, Kaiser & Peacock (1994). They fit- 
ted the averaged P^{k, fif^) to a sum of harmonics, from which they formed 
the quadrupole-to-monopole ratio PI (/c)/Po (A:), which they divided by the 
aforesaid correction factor 0.7 to obtain the true ratio P|(fc)/Po(fe). To 
allow for nonlinearity, they adopted the model in which the linear redshift 
distortion is modulated by a random one-point (not pairwise) exponential 
velocity dispersion. To arrive at a final value of /3 and the velocity disper- 
sion a in each of the 1.2 Jy and QDOT surveys, they carried out a least 
squares fit to the quadrupole-to-monopole ratios as a function of wavenum- 
ber k, using a full covariance matrix estimated from an ensemble of mock 
catalogues constructed from A^-body simulations. 

Fisher & Nusser (1996) studied the Zel'dovich approximation as a 
means of approximating redshift distortions in the mildly nonlinear regime 
(sec §7.3). As an application, they fitted the ratio of quadrupole-to-monopole 
power measured by Cole, Fisher & Weinberg (1995) from the IRAS 1.2 Jy 
survey, finding /? = 0.6 ± 0.2. 

8.2.5. Fry & Gaztanaga 

Fry & Gaztanaga (1994) compared redshift to real space correlation func- 
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tions in each of three redshift surveys, the first Center for Astrophysics 
(CfAl) survey (Huchra et al. 1983), the Southern Sky Redshift Survey 
(SSRS) (da Costa et al. 1991), and the IRAS 2 Jy survey (Strauss et 
al. 1992a). Their procedure was: measure the volume-averaged correlation 
functions ^ = V~'^ fy ^{ri2)cPrid^r2, in spherical volumes V for redshift 
space, and conical volumes V for real space; find the best power law fits to 
^^(r) and ^(r) that reproduce the behaviour of the volume-averaged ^ in 
each case; infer /3 from the ratio ^^(r)/^(r) of the fitted power law correla- 
tion functions at the largest separations probed, r ~ 5-10/i~^Mpc; repeat 
this for several volume-limited subsamples, and adopt an average p. The 
resulting values of P seem surprisingly large for such modest separations. 
However, given the indirectness of the procedure for measuring large scale 
power, and some lack of rigour in the error analysis, one might be inclined 
to take the numbers with a pinch of salt. 

8.2.6. Peacock et al. 

Peacock & Dodds' (1994) principal goal was not so much to measure /? 
as to reconstruct the linear power spectrum of mass fluctuations from a 
compilation of available observational evidence. A feature of the paper was 
the derivation of empirical analytic formulae relating linear and nonlinear 
power spectra, using a procedure inspired by one proposed by Hamilton et 
al. (1991). Updated versions of Peacock & Dodds' formulae are given by 
Peacock & Dodds (1996). 

The value of /? measured by Peacock & Dodds (1994) followed from 
a comparison of redshift space to real space power spectra. For the real 
space power spectrum they adopted the APM power spectrum of Baugh & 
Efstathiou (1993), while for redshift space power spectra they considered 
power spectra from the IRAS QDOT survey (Feldman, Kaiser & Peacock 
1994), the Stromlo-APM survey (Loveday et al. 1992), and the CfA2 sur- 
vey (Vogeley et al. 1992). To combine the various observed power spectra 
into a single canonical linear power spectrum, they introduced linear bias 
factors b, equation (4.15), corrected the galaxy power spectra for nonlinear 
evolution, and modelled nonlinearity in the redshift distortions with a ran- 
dom Gaussianly distributed velocity dispersion. To break the degeneracy 
between Qq and bias b that occurs in /3 « flQ^/b, they brought into con- 
sideration the power spectra of Abell clusters (Peacock & West 1992) and 
of radio galaxies (Peacock & Nicholson 1991), which they assumed to be 
biased but (unlike galaxies) unaffected by nonlinear evolution. With these 
assumptions they found ^'opticai/^7RA5 = 1-3 and Piras = 1-0 =t 0.2, which 
imply also /^optical = 0.77 ± 0.15. The absolute values of the bias factors 
were less well determined than their relative values; the best models had 
biRAS ~ 0.8. 
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Peacock (1997) carried out an improved version of Peacock & Dodds' 
(1994) analysis, arriving at the notably lower value of /^optical = 0.40 ±0.12 
from optical data alone. As in the earlier paper, for optical data Peacock 
used the real space APM power spectrum from Baugh &; Efstathiou (1993; 
also 1994), and redshift space power spectra from the Stromlo-APM survey 
(Loveday et al. 1992) and the CfA2 survey (Vogeley et al. 1992). The reason 
for the lower value of jS was that Peacock increased the real space APM 
power spectrum from Baugh & Efstathiou (1993) by a factor of 1.25 in 
order to match it to the amplitude of the real space correlation function 
of Stromlo-APM, as measured by Loveday et al. (1995) using the cross- 
correlation technique of Saunders et al. (1992). This adjustment reduces 
/^optical from 0.77 to 0.40, in accordance with equation (5.1). 

Peacock carried out a separate analysis for IRAS data, but here the 
results were less conclusive. He used a real space IRAS power spectrum 
obtained by transforming the cross-correlation function between the 1-in- 
6 QDOT survey and its parent QIGC catalogue, measured by Saunders, 
Rowan-Robinson &i Lawrence (1992), while for the redshift power spec- 
trum he adopted Tadros &: Efstathiou's (1995) power spectrum of the com- 
bined IRAS 1.2 Jy and QDOT surveys, in place of the Feldman et al. 
(1994) QDOT power spectrum. The redshift power spectrum of the com- 
bined IRAS 1.2 Jy and QDOT surveys is lower than that of the QDOT 
survey thanks in large part to a region in Hercules, which Tadros &; Ef- 
stathiou showed produces an anomalously large upward excursion in the 
QDOT power spectrum (cf. Hamilton 1995; Oliver et al. 1996). Tadros & 
Efstathiou's downwardly revised redshift power spectrum lay below Saun- 
ders et ffl/.'s real space power spectrum at wavcnumbers k < 0.05 /?. Mpc~^, 
nominally suggesting a negative (3, and causing Peacock to discard those 
points. Such machinations do not inspire confidence, and Peacock avoided 
drawing any definitive conclusion. 

Besides improved data. Peacock applied various theoretical improve- 
ments: he used an improved treatment of nonlinear evolution (Peacock & 
Dodds 1996); the ratio of optical and IRAS real space power spectra indi- 
cated some scale dependence in bias, which he fitted with a two-parameter 
power law model (Mann, Peacock & Heavens 1997) in place of the one- 
parameter linear bias model; he abandoned the use of Abcll cluster and ra- 
dio galaxy power spectra; and he used an exponential rather than Gaussian 
pairwise velocity distribution to model nonlinearity in redshift distortions. 

In the latter half of the paper, Peacock went on to discuss bias, Qq, 
and the shape of the reconstructed, linearized power spectrum, adducing a 
variety of arguments to conclude that a low bias, low Oq model is preferred 
over a high bias, high model. 
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8.2.7. Heavens, Taylor, et al. 

Heavens & Taylor (1995) were the first to apply a full-blown maximum 
likelihood procedure to measure /3, in a manner designed to retain as much 
information as possible about /3 in the linear regime. They took the IRAS 
1.2 Jy survey as the data set, conservatively cut to galactic latitude |6| > 
10°. They chose to work in a basis of spherical waves, and in the frame 
of the Local Group (LG). First, they defined the unrcdshiftcd overdensity 
modes 5nem by the discrete spherical transform (the definition below is the 
complex conjugate of Heavens &; Taylor's definition, but it conforms to the 
convention of the present review, eq. [3.8], which follows that of Peebles 
1980, §46) 

Snim = Cni / jl{kntr)Yim{r)5{r) (fr , 6{r) = ^ Cnijeikn£r)Ylj^{r)6n£m 
^r-<ri„ax 

(8.32) 

where the integration is over a finite sphere of radius Tmax = 200/i~^Mpc, 
the Cni are normalization constants, and the wavenumbers k^i are chosen 
to satisfy the boundary condition j'likniri-aax) = 0, which ensures that the 
predicted linear peculiar velocity field vanishes on the boundary of the 
sphere (see Fisher et al. 1995b for a discussion of boundary conditions). 
Next, they defined observed, redshifted modes Dnem by 

Dnlm^Cnij je{kner)Yini{r)w{r)n{r)5%r) d?r (8.33) 

which, besides being in redshift space, involve an additional factor w{r)n{r) 
in the integrand compared to (8.32). The radial weighting function w{r) 
(which in practice depended also on nt) is a near-minimum- variance weight- 
ing which helps to reduce the noise in each mode, hence to increase the in- 
formation content of each mode, and hence to reduce the number of modes 
required. They derived approximate expressions for the weightings wp{r) 
and Wji{r) that optimize each mode Dntm for measuring respectively the 
normalization of the power spectrum P and the distortion parameter /3. 
What it means to optimize modes for the measurement of particular pa- 
rameters is clarified by Tegmark, Taylor & Heavens (1997) and Tegmark 
et al. (1997). Heavens k, Taylor showed that their modes (abbreviating 
nlm = fi) are linearly related to the unredshifted overdensity modes 5^ by 

The matrix -\- j3V^y can be recognized as the representation in spherical 
transform space of the operator w{r)n{r)S^'^ , where S^*^ is the redshift 



80 



A. J. S. HAMILTON 



distortion operator (4.46) in the LG frame^^. Heavens &; Taylor introduced 
one further refinement, to model nonlinear ity as a random Gaussian velocity 
field (cf. §7.2), which has the effect of modifying the matrix + PV^^^ 
in equation (8.34) by pre-multiplying it by another matrix. The expected 
covariance matrix of the modes is then 

= {D^D;) = Y^i^^a + PVi_,a){^,a + PV,aTP{ka) + N^. (8.35) 
a 

where Nfj^^, is the Poisson sampling noise, which is just the (expected) self- 
galaxy contribution to (D^D*). In all. Heavens and Taylor's covariance 
matrix C^,y took into account finite sky coverage, linear redshift distor- 
tions, nonlinearity modelled as a random Gaussian velocity distribution, 
and Poisson sampling noise. 

The prior covariance C^^, equation (8.35), depends on the unrcdshiftcd 
power spectrum P{k) and on the distortion parameter p. Heavens & Taylor 
adopted as prior the shape of the power spectrum measured by Peacock Sz 
Dodds (1994), but retained its overall amplitude cTg and the distortion pa- 
rameter (3 as parameters to be measured from the likelihood analysis. They 
assumed a Gaussian likelihood function, equation (5.17), and included ob- 
served modes Dnem with kne < 0.1 hMpc~^, a total of 604 modes, with max- 
imum values £ = 17 and n = 6. The prior covariance C^^, equation (8.35), 
was computed including modes 6nim up to ^ = 30 and n = 20. The final re- 
sult of the likelihood analysis was P = l.lzbO.3 . The uncertainty here is the 
conditional la uncertainty, which comes from projecting the Aln£ = —0.5 
contour on to the parameter axes. Heavens (1997, private communication) 
states that a more conservative estimate of the uncertainty in /3 is the 
marginal uncertainty, ±0.5 (lu), which results from integrating the like- 
lihood over the probability distribution of the amplitude erg. Heavens &; 
Taylor commented that the maximum likelihood value of fS dropped to 0.5 
when the maximum kne was increased from 0.1 to 0.12 /iMpc^^, which in- 
creases the number of modes from 604 to 1048. They attributed the change 
in P to the onset of nonlinearity, but it seems likely that statistical fluctu- 
ation rather than nonlinearity would cause so dramatic change in P for so 
small a change in the cutoff wavenumber. 

^''Actually they used the distortion operator S appropriate to the CMB frame, equa- 
tion (4.23), rather than the distortion operator S'"'"' in the LG frame, equation (4.46), 
and they dropped the dipole (£ = 1) modes from the hkehhood analysis. This is not 
quite correct, although the error should be minor for a near all-sky survey. They used 
a selection function, their equation (54), from Fisher, private communication; whether 
this was a real or redshift selection function is unclear. If a redshift selection function, 
then strictly the distortion operator ^'^ relative to the redshift space selection function 
should be used, as discussed in §4.4.2. Again, Heavens & Taylor dropped the monopole 
{£ = 0) modes from the Hkelihood analysis, so whatever the case, any error should be 
minor for this near all-sky survey. 
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Ballinger, Heavens & Taylor (1995) took Heavens & Taylor's procedure 
a step further by parametrizing the power spectrum by its amphtude in six 
logarithmically spaced bins, rather than just an overall normalization. Oth- 
erwise the analysis proceeded essentially exactly as in Heavens &l Taylor, 
with a similar result, (3 = 1.04 ± 0.3. 

Tegmark, Taylor & Heavens (1997) report that the 604 modes used by 
Heavens & Taylor (1995) can be compressed to as few as 60 signal-to-noise 
eigenmodes (so called Karhunen-Loeve modes) of /3, without appreciable 
loss of accuracy in (3. This compression step, which involves solving an 
eigenvalue equation in the covariance matrix C^j^, does not obviate the 
necessity to compute the covariance matrix at full size, but it can reduce 
the work involved in computing likelihood contours for the parameters. 

8.2.8. Lin 

Lin (1995) examined redshift distortions, and measured the pairwise veloc- 
ity dispersion, in the Las Campanas Redshift Survey (LCRS) (Shectman 
et al. 1996), which contains 23697 galaxies in six narrow slices each 1?5 in 
declination and 80° in right ascension, three each in the north and south 
galactic caps. Lin measured (5 = 0.5 ±0.25 using Hamilton's (1992) estima- 
tor of the quadrupole-to-monopole power, equation (8.1). 

8.2.9. Baugh 

Baugh (1996) compared the real space correlation function ^(r) of the APM 
survey obtained by Fourier transforming the power spectrum derived by 
Baugh & Efstathiou (1993), to the redshift space correlation function ^*(r) 
of the Stronilo-APM survey from Loveday et al. (1995). Uncertainty in the 
APM real space power spectrum arises from uncertainty in cosmological 
geometry (i.e. in f^o), and in the evolution of the power spectrum with 
redshift, parametrized as P{k) oc (1 -|- z)^° at fixed comoving wavenumber 
k (so a = 2 for linear growth in an = 1 Universe). Baugh (his Table 2) 
quoted three values of /3, two assuming no evolution, a = and J7o = 1 
or 0.2, giving respectively (3 = 0.61^q23 or 0.451^q'23 , and one value with 
evolution, a = 2 and = 1, giving j3 = 0.201q 22 • Evidently the value of 
/? is quite sensitive to the assumed geometry and evolution. Observations 
suggest quite strong evolution. For example, Le Fevre et al. (1996) find 
from the Canada-France Redshift Survey (CFRS) an evolutionary index-^^ 
e = 0.4 lb 1.1, corresponding to a = 1.7 it 1.1. Table 8.1 in this review 
therefore lists Baugh's value fi = 0.20^o;22 with evolution. Presumably the 
uncertainty arising from evolution could be resolved in a manner similar 

^''Le Fevre et al. define evolution with respect to proper, not comoving coordinates: 
their index e is related to Baugh's index abyS-l-e = a + ^, where 7 « 1.7 is the 
logarithmic slope of the correlation function. 
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to Peacock (1997) — the APM power spectrum could be normalized so 
as to agree with the amphtude of the real space correlation function of 
Stromlo-APM deduced by Loveday et al. (1995). 

8.2.10. Loveday et al. 

Loveday et al. (1996a) examined the Stromlo-APM redshift survey (Love- 
day et al. 1996b), a l-in-20 survey of galaxies brighter hj = 17.15 in 
the Automatic Plate Measuring (APM) survey (Maddox et al. 1990a,b, 
1996). They considered both of the first two methods described in §5. 
They measured the real space correlation function ^(r) from the projected 
cross-correlation between the Stromlo-APM survey and its parent APM 
survey, using the procedure of Saunders, Rowan-Robinson &; Lawrence 
(1992), as described by Loveday et al. (1995). They measured the monopole, 
quadrupole, and hexadecapole harmonics ^|(r) of the redshift space corre- 
lation function using the procedure of Hamilton (1993b). 

They tested the different methods with mock catalogues constructed 
from two ensembles of AT-body simulations described by Croft k, Efstathiou 
(1994), 10 flat CDM simulations with a cosmological constant, VIm = 0.2 
and = 0.8, and 9 flat Mixed Dark Matter simulations, r^cDM = 0.7 
and 0.1, = 0.3 {v for neutrinos denotes Hot Dark Matter). Loveday et al. 
found from the simulations that nonlinearities affect the angle-averaged 
(monopole) correlation function E,^{t^ up to separations r ~ 10/i-^Mpc, 
its volume integral J*(r) = jo ^^{s)s'^ds to separations r ~ 20/i~^Mpc, 
and the quadrupole and hexadecapole harmonics CK'^) ^K'') to scales 
r ~ 30/i~^Mpc. They concluded that the first method, the ratio of red- 
shift to real space correlation functions (or its integrals) is likely to yield 
more reliable measures of f3. In practice the ratio J* (r) / J(r) of integrated 
correlation functions proved considerably less noisy than C*(r')/^(r), so the 
former was their statistic of choice. 

For the Stromlo-APM survey, Loveday et al. found /? = 0.48 ±0.12 from 
the ratio J^{r)/ J{r) at its most accurately measured point, a separation of 
r = 17.8 /i~^Mpc. This is also the point where the observed ratio J^{r)/J{r) 
reached its maximum value, so they also concluded more conservatively 
that the data favoured j3 ^ 0.6. They also considered Hamilton's (1992) 
quadrupole-to-monopole ratio, equation (8.1), but were unable to draw any 
conclusions from this estimator. The conclusion that nonlinearities have 
a large effect on the quadrupole-to-monopole ratio in the Stromlo-APM 
survey agrees with the analysis of the present review, §6. 

8.2.11. Tadros & Efstathiou 

Tadros & Efstathiou (1996) carried out an analysis similar to Baugh (1996), 
§8.2.9, except that they worked with the power spectrum instead of the cor- 
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relation function. Tadros &: Efstathiou computed the redshift space power 
spectrum (k) of the Stromlo-APM survey, and compared this to the real 
space power spectrum P{k) of the APM survey computed by Baugh & 
Efstathiou (1993). Table 8.1 quotes the 'Flux Limited' (misprinted 'Flux 
Weighted') result from Tadros & Efstathiou's Table 2, since this weighting 
is expected to (and in fact did) give lower variance than the 'Volume Lim- 
ited' weighting. As with Baugh (1996), systematic uncertainty in the real 
space APM power spectrum arises from uncertainty in its rate of evolution 
with redshift, and again Tadros & Efstathiou gave values both without evo- 
lution, P = 0.74 ± 0.48 for a = 0, and with evolution, /3 = 0.20 ± 0.44 for 
a = 1.3. Table 8.1 quotes the value with evolution. 

8.2.12. Ratcliffe et al. 

Ratcliffe et al. (1997) studied the Durham/UKST (UK Schmidt Telescope) 
redshift survey, which is a > 75% complete l-in-3 redshift survey of galaxies 
brighter than 6j 17 in the Edinburgh/Durham Southern Galaxy Cata- 
logue (Collins, Heydon-Dumbleton & MacGillivray 1988; Collins, Nichol 
&L Lumsden 1992). The survey contains ~ 2500 galaxies over a region 
~ 20° X 70° centred on the South Galactic Pole. 

Ratcliffe et al. used both of the first two methods described in §5. For the 
first method, they measured both the angle-averaged redshift correlation 
function C*(r) and the real space correlation function ^(r) from the survey, 
inferring the latter from the projected redshift correlation function using 
Saunders et al.^s (1992) procedure. The resulting ratio ^*(r)/^(r) of redshift 
to real correlation functions was quite noisy, so they considered instead the 
ratio J'^{r)/J{r) of the integrated correlation functions J(r) = JJ" ^(s)s^ds, 
finding (3 = 0.52 ± 0.39 at a separation r = 20/i~^Mpc. 

For the second method, Ratcliffe et al. used Hamilton's (1992) estimator 
of the quadrupole-to-monopole power, equation (8.1). Combining results 
over separations r « 10-20 /i~^Mpc gave /? = 0.48 ± 0.11. 

Ratcliffe et al. tested their procedures extensively with two sets of CDM 
N-hody simulations, one set an = 1 biased b = 1.6 model, the other 
a spatially flat = 0.2, U\ = 0.8 unbiased model with a cosmological 
constant. These simulations were not matched in every respect to the obser- 
vations — in particular the simulations had a considerably larger small scale 
velocity dispersion. On the assumption of an exponential pairwise velocity 
distribution, the 1-dimensional pairwise velocity dispersions were measured 
to be 980 ± 22 km s~^ and 835 it 60 km s~^ respectively for the Q,m = 1 and 
0.2 models, compared to 416 ± 36kms~^ for the Durham/UKST survey. 
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Figure 8. In this diagram, each wedge represents a cone of fixed opening angle, with 
the observer (us) at the point of the cone, at zero redshift. The wedges show the rela- 
tion between physical sizes, namely the comoving distances in the radial (vertical) and 
transverse (horizontal) directions, and observable quantities, namely redshift and angular 
separation, in three different cosmological models: (left) flat matter-dominated Universe; 
(middle) open matter-dominated Universe with Oo = 0.2; (right) flat Universe with a 
cosmological constant, = 0.8. 

9. Cosmological Redshift Distortions 

The relation between redshift and radial comoving distance, and between 
angle and transverse comoving distance, is different for different cosmolog- 
ical models, as illustrated in Figure 8. The differences produce a cosmolog- 
ical redshift distortion that is zero at zero redshift, but that becomes more 
marked at higher redshift. 

The idea of using cosmological redshift distortions to measure cosmolog- 
ical parameters, notably the cosmological constant, was first proposed by 
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Alcock & Paczynski (1979). More recently, several authors have considered 
the combined effect of distortions from peculiar velocities and from cosmol- 
ogy, both to see how cosmological distortion affects the measurement of /3, 
and to assess the prospect of measuring cosmological parameters from up- 
coming redshift surveys (Ballinger, Peacock k, Heavens 1996; Matsubara &; 
Suto 1996; Nakamura, Matsubara & Suto 1997; de Laix & Starkman 1997). 

Alcock & Paczynski (1979) argued, and subsequent authors have con- 
curred, that the largest cosmological distortion is a A-squashing, a squash- 
ing in the radial direction that occurs if there is a large cosmological con- 
stant. By comparison, the differences in redshift distortions between low 
and high Q.m models in the absence of a cosmological constant are rela- 
tively small. 

The differences between redshift distortions predicted by models with 
and without a cosmological constant at first increase with redshift, but then 
saturate at z ~ 1 for physically interesting models, those with ~ 0.1 
(Ballinger, Peacock & Heavens 1996; Matsubara & Suto 1996). Even at 
these redshifts, the cosmological distortion is not easy to distinguish from 
redshift distortions caused by peculiar velocities. This degeneracy could in 
principle be resolved because the cosmological and peculiar velocity signals 
evolve differently with redshift, but in practice the uncertain evolution of 
bias muddies the issue (Ballinger, Peacock & Heavens 1996). Nakamura, 
Matsubara k, Suto (1997) emphasize that cosmological redshift distortions 
will affect the linear distortion parameter (5 at the 10-20% level in the 
Sloan Digital Sky Survey (SDSS), which aims to go to a median depth 
of z ~ 0.1. dc Laix & Starkman (1997) conclude that the SDSS will not 
provide a clean signal of cosmological parameters from redshift distortions 
in the linear regime. 
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